...
首页> 外文期刊>Journal of computer sciences >A HYBRID METHOD FOR AUTOMATIC SPEECH RECOGNITION PERFORMANCE IMPROVEMENT IN REAL WORLD NOISY ENVIRONMENT | Science Publications
【24h】

A HYBRID METHOD FOR AUTOMATIC SPEECH RECOGNITION PERFORMANCE IMPROVEMENT IN REAL WORLD NOISY ENVIRONMENT | Science Publications

机译:真实噪声环境中自动语音识别性能改进的混合方法科学出版物

获取原文
           

摘要

> It is a well known fact that, speech recognition systems perform well when the system is used in conditions similar to the one used to train the acoustic models. However, mismatches degrade the performance. In adverse environment, it is very difficult to predict the category of noise in advance in case of real world environmental noise and difficult to achieve environmental robustness. After doing rigorous experimental study it is observed that, a unique method is not available that will clean the noisy speech as well as preserve the quality which have been corrupted by real natural environmental (mixed) noise. It is also observed that only back-end techniques are not sufficient to improve the performance of a speech recognition system. It is necessary to implement performance improvement techniques at every step of back-end as well as front-end of the Automatic Speech Recognition (ASR) model. Current recognition systems solve this problem using a technique called adaptation. This study presents an experimental study that aims two points, first is to implement the hybrid method that will take care of clarifying the speech signal as much as possible with all combinations of filters and enhancement techniques. The second point is to develop a method for training all categories of noise that can adapt the acoustic models for a new environment that will help to improve the performance of the speech recognizer under real world environmental mismatched conditions. This experiment confirms that hybrid adaptation methods improve the ASR performance on both levels, (Signal-to-Noise Ratio) SNR improvement as well as word recognition accuracy in real world noisy environment.
机译: >众所周知的事实是,当语音识别系统在类似于训练声学模型的条件下使用时,其性能会很好。但是,不匹配会降低性能。在不利的环境中,在实际环境噪声的情况下,很难提前预测噪声的类别,并且很难实现环境的鲁棒性。在进行了严格的实验研究后,我们发现,没有一种独特的方法可以清除嘈杂的语音并保持被真实的自然环境(混合)噪声破坏的质量。还观察到,仅后端技术不足以改善语音识别系统的性能。在自动语音识别(ASR)模型的后端和前端的每一步,都必须实施性能改进技术。当前的识别系统使用称为自适应的技术解决了这个问题。这项研究提出了一项针对两点的实验性研究,首先是实施混合方法,该方法将充分利用滤波器​​和增强技术的所有组合来尽可能清晰地说明语音信号。第二点是开发一种训练所有类型的噪声的方法,该方法可以使声学模型适应新的环境,这将有助于在现实环境中的不匹配条件下改善语音识别器的性能。该实验证实了混合自适应方法在两个级别上都改善了ASR性能,(信噪比)SNR的提高以及现实世界中嘈杂环境中的单词识别准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号