首页> 美国政府科技报告 >Cepstral Domain Talker Stress Compensation for Robust Speech Recognition
【24h】

Cepstral Domain Talker Stress Compensation for Robust Speech Recognition

机译:用于鲁棒语音识别的倒谱域语音应力补偿

获取原文

摘要

Automatic speech recognition algorithms generally rely on the assumption that for the distance measure used, intraword variabilities are smaller than interword variabilities so that appropriate separation in the measured space is possible. As evidenced by degradation of recognition performance, the validity of such an assumption decreases from simple tasks to complex tasks, from cooperative talkers to casual talkers, and from laboratory talking environments to practical talking environments. This paper presents a study of talker-stress-induced intraword variability, and an algorithm that compensates for the systematic changes observed. The study is based on hidden Markov models trained by speech tokens spoken in various talking styles. The talking styles include normal speech, fast speech, loud speech, soft speech, and talking with noise injected through earphones; the styles are designed to simulate speech produced under real, stressful conditions. Cepstral coefficients are used as the parameters in the hidden Markov models. Substantial reduction of error rates had been achieved when the cepstral domain compensation techniques were tested on the 'simulated stress' speech database.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号