首页> 外国专利> System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling

System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling

机译:通过改进语音建模来提高词汇中单词识别率的系统和方法

摘要

The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations based on symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio.
机译:本公开涉及用于生成与语音识别一起使用的词典的系统,方法和计算机可读介质。该方法包括:基于符号输入来过度生成潜在的发音;在语音识别上下文中识别潜在的发音;以及在词典中存储所识别的潜在的发音。潜在语音的过度生成可能包括为短字母序列建立一组转换规则,基于该组转换规则将部分符号输入转换为许多可能的词汇发音变体,在一个加权的模型中对可能的词汇发音变体进行建模网络和音素列表的列表,并基于改进的发音来迭代地重新训练一组转换规则。符号输入可以包括相同口语单词的多个示例。语音数据可以显式或隐式标记,并且可以包含单词(如文本)和录制的音频。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号