【24h】

Acoustic Language Model Classes for A Large Vocabulary Continuous Speech Recognizer

机译:大型词汇连续语音识别器的声学语言模型类

获取原文

摘要

In a maximum a posteriori probability approach to speech recognition stochastic n-gram language models are used for the estimation of a word sequence's a priori probability. In any practical implementation of a large vocabulary speech recognition system the language model acts as a hypotheses filter that has to differ between candidate words with similar acoustic evidence. For that purpose, the combination of word based and class based language models is attractive, because it allows to fall back to the more reliable estimates of the class based model in case of sparse training data. However, class language models can differ between words from the same class only in terms of a priori probability. To improve the discriminative power for words with similar acoustic score, it is therefore useful to put similar sounding words into different classes.
机译:在最大程度上,语音识别随机n元语法模型的后验概率方法用于估计单词序列的先验概率。在大型词汇语音识别系统的任何实际实现中,语言模型都充当假设过滤器,该过滤器在具有相似声学证据的候选单词之间必须有所不同。为此,基于单词的语言模型和基于类的语言模型的组合具有吸引力,因为在稀疏的训练数据的情况下,它可以退回到基于类的模型的更可靠的估计。但是,仅在先验概率方面,课堂语言模型可以在来自同一课堂的单词之间有所不同。为了提高具有相似声学分数的单词的判别能力,因此将相似发音的单词放入不同的类别很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号