首页> 外文期刊>IEE Proceedings. Part K >Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
【24h】

Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model

机译:基于分段概率模型的大词汇量汉语连续汉语语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

The author presents a study of large- vocabulary continuous Mandarin speech recognition based on a segmental probability model (SPM) approach. SPM was found to be very suitable for recognition of isolated Mandarin syllables especially considering the monosyllabic structure of Chinese language. To extend the application of the model to continuous Mandarin speech recognition, a concatenated syllable matching (CSM) algorithm in place of the conventional Viterbi search algorithm is first introduced. Also, to utilise the available training material efficiently, a training procedure is proposed to re-estimate the SPM parameters using the maximum a posteriori (MAP) algorithm. A few special techniques integrating acoustic and linguistic knowledge are developed further to improve the performance step by step. Preliminary experimental results show that the final achievable rate is as high as 91.62/100, which indicates a 18.48/100 error rate reduction and more than three times faster than the well studied subsyllable-based CHMM.
机译:作者提出了一种基于分段概率模型(SPM)方法的大词汇量连续汉语普通话语音识别的研究。发现SPM非常适合识别孤立的普通话音节,特别是考虑到汉语的单音节结构。为了将模型的应用扩展到连续的普通话语音识别,首先引入了串联音节匹配(CSM)算法来代替传统的维特比搜索算法。另外,为了有效地利用可用的培训材料,提出了一种培训程序,以使用最大后验(MAP)算法重新估计SPM参数。进一步开发了一些整合声学和语言知识的特殊技术,以逐步改善性能。初步实验结果表明,最终可达到的比率高达91.62 / 100,这表明错误率降低了18.48 / 100,并且比经过充分研究的基于音节的CHMM快三倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号