首页> 外文会议>Workshop on speech and language processing for assistive technologies >Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design
【24h】

Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design

机译:改进连续手语识别:语音识别技术和系统设计

获取原文

摘要

Automatic sign language recognition (ASLR) is a special case of automatic speech recognition (ASR) and computer vision (CV) and is currently evolving from using artificial lab-generated data to using 'real-life' data. Although ASLR still struggles with feature extraction, it can benefit from techniques developed for ASR. We present a large-vocabulary ASLR system that is able to recognize sentences in continuous sign language and uses features extracted from standard single-view video cameras without using additional equipment. ASR techniques such as the multi-layer-perceptron (MLP) tandem approach, speaker adaptation, pronunciation modelling, and parallel hidden Markov models are investigated. We evaluate the influence of each system component on the recognition performance. On two publicly available large vocabulary. databases representing lab-data (25 signer, 455 sign vocabulary, 19k sentence) and unconstrained 'real-life' sign language (1 signer, 266 sign vocabulary, 351 sentences) we can achieve 22.1% respectively 38.6% WER.
机译:自动手语识别(ASLR)是自动语音识别(ASR)和计算机视觉(CV)的特例,目前正从使用人工实验室生成的数据发展为使用“真实”数据。尽管ASLR仍在特征提取方面苦苦挣扎,但它可以从为ASR开发的技术中受益。我们提出了一种大型词汇的ASLR系统,该系统能够识别连续手语的句子,并使用从标准单视角摄像机提取的功能,而无需使用其他设备。研究了ASR技术,例如多层感知器(MLP)串联方法,说话人自适应,语音建模和并行隐马尔可夫模型。我们评估每个系统组件对识别性能的影响。在两个公开可用的大词汇量上。代表实验室数据(25个签名者,455个签名词汇,19k句子)和不受约束的“现实生活”手语(1个签名者,266个签名词汇,351个句子)的数据库,我们可以分别实现22.1%的WER和38.6%的WER。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号