首页> 外文期刊>Speech Communication >Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition
【24h】

Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition

机译:评估说话人独立的孤立单词识别中的文本到音素映射策略

获取原文
获取原文并翻译 | 示例
           

摘要

A phonetic transcription of the vocabulary, i.e., a lexicon, is needed in sub-word based speech recognition and text-to-speech systems. Decision trees and neural networks have successfully been used for creating lexicons on-line from an open vocabulary. We briefly review these methods and compare them in detail in the text-to-phoneme mapping task as part of a phoneme based speaker independent speech recognizer. The decision tree and neural network based methods were first evaluated in terms of phoneme accuracy and then in extensive speech recognition tests. American english dictionaries and speech databases were used in all experiments. The decision tree based method achieved high phoneme accuracies when the training material covered the test vocabulary well. In typical speech recognition tests, the recognition rates obtained using the decision tree based lexicons were close to the baseline that was obtained using accurate transcriptions. Although the lexicons obtained using neural networks resulted in somewhat lower baseline recognition rates, they provided slightly better results in generalization tests. Moreover, when the neural network based mappings were appended with a look-up table comprising the most likely vocabulary items, which would be the practical set-up, their performance increased significantly. The main advantage of neural networks over decision trees is their low memory consumption.
机译:在基于子词的语音识别和文本转语音系统中,需要词汇的语音转录,即词典。决策树和神经网络已成功用于从开放词汇表在线创建词典。我们简要回顾了这些方法,并在基于文本的音素映射任务中对它们进行了详细比较,作为基于音素的独立于说话者的语音识别器的一部分。首先根据音素准确性评估基于决策树和神经网络的方法,然后再进行广泛的语音识别测试。在所有实验中都使用了美式英语词典和语音数据库。当培训材料很好地覆盖了测试词汇时,基于决策树的方法获得了较高的音素准确性。在典型的语音识别测试中,使用基于决策树的词典获得的识别率接近使用准确转录获得的基线。尽管使用神经网络获得的词典导致较低的基线识别率,但它们在泛化测试中提供了更好的结果。此外,当基于神经网络的映射附加有包含最可能的词汇项目的查找表(这是实际的设置)时,其性能会大大提高。与决策树相比,神经网络的主要优势是低内存消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号