Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design

机译：改进连续手语识别：语音识别技术和系统设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic sign language recognition (ASLR) is a special case of automatic speech recognition (ASR) and computer vision (CV) and is currently evolving from using artificial lab-generated data to using 'real-life' data. Although ASLR still struggles with feature extraction, it can benefit from techniques developed for ASR. We present a large-vocabulary ASLR system that is able to recognize sentences in continuous sign language and uses features extracted from standard single-view video cameras without using additional equipment. ASR techniques such as the multi-layer-perceptron (MLP) tandem approach, speaker adaptation, pronunciation modelling, and parallel hidden Markov models are investigated. We evaluate the influence of each system component on the recognition performance. On two publicly available large vocabulary. databases representing lab-data (25 signer, 455 sign vocabulary, 19k sentence) and unconstrained 'real-life' sign language (1 signer, 266 sign vocabulary, 351 sentences) we can achieve 22.1% respectively 38.6% WER.

机译：自动手语识别（ASLR）是自动语音识别（ASR）和计算机视觉（CV）的特例，目前正从使用人工实验室生成的数据发展为使用“真实”数据。尽管ASLR仍在特征提取方面苦苦挣扎，但它可以从为ASR开发的技术中受益。我们提出了一种大型词汇的ASLR系统，该系统能够识别连续手语的句子，并使用从标准单视角摄像机提取的功能，而无需使用其他设备。研究了ASR技术，例如多层感知器（MLP）串联方法，说话人自适应，语音建模和并行隐马尔可夫模型。我们评估每个系统组件对识别性能的影响。在两个公开可用的大词汇量上。代表实验室数据（25个签名者，455个签名词汇，19k句子）和不受约束的“现实生活”手语（1个签名者，266个签名词汇，351个句子）的数据库，我们可以分别实现22.1％的WER和38.6％的WER。

著录项

来源
《Workshop on speech and language processing for assistive technologies》|2013年|41-46|共6页
会议地点
作者
Jens Forster; Oscar Koller; Christian Oberdoerfer; Yannick Gweth; Hermann Ney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Continuous Sign Language Recognition; Large Vocabulary; ASR; Computer Vision; Recognition System;

机译：连续手语识别大词汇量ASR;计算机视觉;识别系统;

相似文献

外文文献
中文文献
专利

1. A Kinect-Based Sign Language Hand Gesture Recognition System for Hearing- and Speech-Impaired: A Pilot Study of Pakistani Sign Language [J] . Halim Zahid, Abbas Ghulam Assistive technology: the official journal of RESNA . 2015,第1期

机译：基于Kinect的听觉和言语障碍者手势手势识别系统：巴基斯坦手语的初步研究
2. Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers [J] . Oscar Koller, Jens Forster, Hermann Ney Computer vision and image understanding . 2015,第DECa期

机译：连续手语识别：面向处理多个签名者的大型词汇统计识别系统
3. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
4. Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design [C] . Jens Forster, Oscar Koller, Christian Oberdoerfer, Workshop on speech and language processing for assistive technologies . 2013

机译：提高持续行程语言识别：语音识别技术和系统设计
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Enhanced protein domain discovery by using language modeling techniques from speech recognition [O] . Lachlan Coin, Alex Bateman, Richard Durbin 2003

机译：通过使用语音识别中的语言建模技术来增强蛋白质结构域发现
7. Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design [O] . Forster Jens, Koller Oscar, Oberdörfer Christian, 2013

机译：改进连续手语识别：语音识别技术和系统设计

Improving Continuous Sign Language Recognition: Speech Recognition Techniques and System Design

摘要

著录项

相似文献

相关主题

期刊订阅