Large vocabulary continuous speech recognition using HTK

机译：使用HTK的大词汇量连续语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

HTK is a portable software toolkit for building speech recognition systems using continuous density hidden Markov models developed by the Cambridge University Speech Group. One particularly successful type of system uses mixture density tied-state triphones. We have used this technique for the 5 k/20 k word ARPA Wall Street Journal (WSJ) task. We have extended our approach from using word-internal gender independent modelling to use decision tree based state clustering, cross-word triphones and gender dependent models. Our current systems can be run with either bigram or trigram language models using a single pass dynamic network decoder. Systems based on these techniques were included in the November 1993 ARPA WSJ evaluation, and gave the lowest error rate reported on the 5 k word bigram, 5 k word trigram and 20 k word bigram "hub" tests and the second lowest error rate on the 20 k word trigram "hub" test.

机译：HTK是一种用于建立语音识别系统的便携式软件工具包，使用剑桥大学语音组开发的连续密度隐马尔可夫模型。一种特别成功的系统使用混合密度绑定状态三倍。我们使用此技术为5 k / 20 k Word Arpa Wall Street Journal（WSJ）任务。我们已经扩展了我们的方法，使用Word-Internal性别独立建模来使用基于决策树的状态群集，跨字三字和性别依赖模型。我们目前的系统可以使用单通动态网络解码器与BIGRAM或TRIGRAM语言模型一起运行。基于这些技术的系统包括在1993年11月的ARPA WSJ评估中，给出了5 K Word Bigram，5 K Word Trigram和20 K字Bigram“集线器”测试的最低错误率，以及第二个最低错误率20 K Word Trigram“Hub”测试。

著录项

来源
《》|1994年|P.II.125-II.128|共1页
会议地点
作者
Woodland; P.C.; Odell; J.J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
2. Effect of Vocabulary Extension using Word Sequence Concatenation for Large Vocabulary Continuous Speech Recognition [J] . YOSUKE WADA, NORIHIKO KOBAYASHI, YUICHIRO NAKANO 情報処理学会論文誌 . 1999,第4期

机译：单词序列级联对词汇扩展对大词汇量连续语音识别的影响
3. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
4. The 1994 HTK large vocabulary speech recognition system [C] . Woodland, P.C., Leggetter, . 1995

机译：1994年HTK大词汇语音识别系统
5. An Error Detection and Correction Framework to Improve Large Vocabulary Continuous Speech Recognition [D] . Zhou, Zhengyu 2009

机译：一种提高大词汇量连续语音识别能力的错误检测与纠正框架
6. Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition [O] . Jibin Wu, Emre Yılmaz, Malu Zhang, 2020

机译：大型词汇自动语音识别深尖峰神经网络
7. Research and Development of Continuous Speech Recognition Based on HTK and Microsoft Speech SDK [O] . 黄旭 2007

机译：基于HTK和Microsoft Speech SDK的连续语音识别技术的研究与开发

Large vocabulary continuous speech recognition using HTK

摘要

著录项

相似文献

相关主题

期刊订阅