首页> 外文会议> >Large vocabulary continuous speech recognition using HTK
【24h】

Large vocabulary continuous speech recognition using HTK

机译:使用HTK的大词汇量连续语音识别

获取原文

摘要

HTK is a portable software toolkit for building speech recognition systems using continuous density hidden Markov models developed by the Cambridge University Speech Group. One particularly successful type of system uses mixture density tied-state triphones. We have used this technique for the 5 k/20 k word ARPA Wall Street Journal (WSJ) task. We have extended our approach from using word-internal gender independent modelling to use decision tree based state clustering, cross-word triphones and gender dependent models. Our current systems can be run with either bigram or trigram language models using a single pass dynamic network decoder. Systems based on these techniques were included in the November 1993 ARPA WSJ evaluation, and gave the lowest error rate reported on the 5 k word bigram, 5 k word trigram and 20 k word bigram "hub" tests and the second lowest error rate on the 20 k word trigram "hub" test.
机译:HTK是一种用于建立语音识别系统的便携式软件工具包,使用剑桥大学语音组开发的连续密度隐马尔可夫模型。一种特别成功的系统使用混合密度绑定状态三倍。我们使用此技术为5 k / 20 k Word Arpa Wall Street Journal(WSJ)任务。我们已经扩展了我们的方法,使用Word-Internal性别独立建模来使用基于决策树的状态群集,跨字三字和性别依赖模型。我们目前的系统可以使用单通动态网络解码器与BIGRAM或TRIGRAM语言模型一起运行。基于这些技术的系统包括在1993年11月的ARPA WSJ评估中,给出了5 K Word Bigram,5 K Word Trigram和20 K字Bigram“集线器”测试的最低错误率,以及第二个最低错误率20 K Word Trigram“Hub”测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号