...
首页> 外文期刊>International Journal of Engineering and Technology >Effect of states and mixtures in HMM model and Connected word Recognition of Profoundly deaf and hard of hearing speech
【24h】

Effect of states and mixtures in HMM model and Connected word Recognition of Profoundly deaf and hard of hearing speech

机译:HMM模型中状态和混合的影响以及严重聋哑和重听语音的关联单词识别

获取原文
           

摘要

It is a challenge for many years that how to fix the no. of states and no. of mixtures when HMM models are used for speech recognition. In this paper we have analysed that for hearing impaired speech that is partially intelligible to people who are speaking to them frequently and it is not understandable by the unfamiliar listeners. They suffer in many aspects like education and in public places to converse with the normal speakers. Since speech is unique most of the time normal speech itself could not be understand by others. If we develop the speech recognizer for their speech it will convert their unintelligible speech into intelligible speech. Speaker dependent connected digit recognition for this task using HTK tool kit is done and the average recognition accuracy obtained is 93%. Totally 10 speakers out of which 3 are hard of hearing and 7 are profoundly deaf are considered for this experiment. Then for isolated words, no. of mixtures are varied from 3 to 10 for each set of states such as 6, 7, 8, 9, 10 and recognition accuracy is verified for each case. When we varied beyond that there is no any significant change in recognition accuracy and so it is concluded that we can have mixture and state value as 10 for small vocabulary and the recognition performance for all types of feature is comparable to that of normal speech recognition. But irrespective of the state higher recognition is achieved at 8 or 9 or 10 mixer value for different type of feature and it can be concluded that, if we have the mixer value as 8 , 9 or 10 we can get reasonable results.
机译:多年来,如何解决这一难题是一个挑战。州,没有。 HMM模型用于语音识别时混合物的数量在本文中,我们已经分析了听力受损的语音,这种语音对于经常与他们说话的人来说是部分可理解的,而陌生的听众则无法理解。他们在许多方面受苦,例如教育和在公共场所与普通说话者交谈。由于语音在大多数情况下都是唯一的,其他人无法理解普通语音本身。如果我们为他们的语音开发语音识别器,它将把他们难以理解的语音转换为可理解的语音。使用HTK工具套件完成了与说话者相关的连接数字识别,并且平均识别准确度为93%。此实验考虑了总共10位说话者,其中3位听不清,7位严重失聪。然后对于孤立的单词,不。对于每种状态集(例如6、7、8、9、10),混合物的混合比例从3到10不等,并且每种情况下的识别精度均得到验证。当我们超出此范围时,识别准确度没有任何显着变化,因此得出的结论是,对于小词汇量,我们可以将混合值和状态值设为10,并且所有类型的特征的识别性能都可以与普通语音识别相媲美。但是,不管状态如何,对于不同类型的特征,在8或9或10的混合器值下都可以获得较高的识别度,并且可以得出结论,如果我们将混合器值设为8,9或10,则可以得到合理的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号