...
首页> 外文期刊>EURASIP journal on advances in signal processing >Likelihood-Maximizing-Based Multiband Spectral Subtractionfor Robust Speech Recognition
【24h】

Likelihood-Maximizing-Based Multiband Spectral Subtractionfor Robust Speech Recognition

机译:基于似然最大化的多频带谱减法用于鲁棒语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic speech recognition performance degrades significantly when speech is affected by environmental noise. Nowadays, themajor challenge is to achieve good robustness in adverse noisy conditions so that automatic speech recognizers can be used in realsituations. Spectral subtraction (SS) is a well-known and effective approach; it was originally designed for improving the qualityof speech signal judged by human listeners. SS techniques usually improve the quality and intelligibility of speech signal whilespeech recognition systems need compensation techniques to reduce mismatch between noisy speech features and clean trainedacoustic model. Nevertheless, correlation can be expected between speech quality improvement and the increase in recognitionaccuracy. This paper proposes a novel approach for solving this problem by considering SS and the speech recognizer not as twoindependent entities cascaded together, but rather as two interconnected components of a single system, sharing the common goalof improved speech recognition accuracy. This will incorporate important information of the statistical models of the recognitionengine as a feedback for tuning SS parameters. By using this architecture, we overcome the drawbacks of previously proposedmethods and achieve better recognition accuracy. Experimental evaluations show that the proposed method can achieve significantimprovement of recognition rates across a wide range of signal to noise ratios.
机译:当语音受到环境噪声的影响时,自动语音识别性能会大大降低。如今,主要的挑战是在不利的嘈杂条件下实现良好的鲁棒性,以便可以在现实中使用自动语音识别器。频谱减法(SS)是一种众所周知的有效方法。它最初旨在提高人类听众判断的语音信号质量。 SS技术通常可以提高语音信号的质量和清晰度,而语音识别系统需要补偿技术来减少嘈杂的语音特征与干净的已训练声学模型之间的不匹配。然而,可以预期语音质量的提高与识别准确性的提高之间的相关性。本文提出了一种解决此问题的新颖方法,它认为SS和语音识别器不是作为级联在一起的两个独立实体,而是作为单个系统的两个相互连接的组件,共同的目标是提高语音识别的准确性。这将结合识别引擎统计模型的重要信息,作为调整SS参数的反馈。通过使用这种体系结构,我们克服了先前提出的方法的缺点,并获得了更好的识别精度。实验评估表明,该方法可以在很大的信噪比范围内实现识别率的显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号