首页> 外文会议>International conference on text, speech and dialogue >Recognition of the Electrolaryngeal Speech: Comparison Between Human and Machine
【24h】

Recognition of the Electrolaryngeal Speech: Comparison Between Human and Machine

机译:电喉语音的识别:人与机器的比较

获取原文

摘要

Automatic recognition of an electrolaryngeal speech is usually a hard task due to the fact that all phonemes tend to be voiced. However, using a strong language model (LM) for continuous speech recognition task, we can achieve satisfactory recognition accuracy. On the other hand, the recognition of isolated words or phrase sentences containing only several words poses a problem, as in this case, the LM does not have a chance to properly support the recognition. At the same time, the recognition of short phrases has a great practical potential. In this paper, we would like to discuss poor performance of the electrolaryngeal speech automatic speech recognition (ASR), especially for isolated words. By comparing the results achieved by humans and the ASR system, we will attempt to show that even humans are unable to distinguish the identity of the word, differing only in voicing, always correctly. We describe three experiments: the one represents blind recognition, i.e., the ability to correctly recognize an isolated word selected from a vocabulary of more than a million words. The second experiment shows results achieved when there is some additional knowledge about the task, specifically, when the recognition vocabulary is reduced only to words that actually are included in the test. And the third test evaluates the ability to distinguish two similar words (differing only in voicing) for both the human and the ASR system.
机译:由于所有音素都倾向于发声,因此自动识别喉咙语音通常是一项艰巨的任务。但是,使用强大的语言模型(LM)进行连续的语音识别任务,我们可以获得令人满意的识别精度。另一方面,孤立单词或仅包含几个单词的短语句子的识别带来了问题,因为在这种情况下,LM没有机会适当地支持该识别。同时,短短语的识别具有很大的实践潜力。在本文中,我们想讨论电喉语音自动语音识别(ASR)的性能较差,特别是对于孤立单词而言。通过比较人类和ASR系统所获得的结果,我们将尝试证明,即使人类也无法区分单词的身份,仅在发音上有所不同,而且总是正确的。我们描述了三个实验:一个代表盲目识别,即能够正确识别从超过一百万个单词的词汇中选择的孤立单词的能力。第二个实验显示了在对任务有一些额外的了解时,特别是当识别词汇仅减少到测试中实际包含的单词时所获得的结果。第三项测试评估了区分人类和ASR系统的两个相似单词(仅在发音上有所不同)的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号