首页> 外文会议>Audio Engineering Society convention >Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training
【24h】

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

机译:评估源分离方法在具有特定鸡尾酒会训练的鲁棒自动语音识别中的影响

获取原文

摘要

Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.
机译:自动语音识别(ASR)允许计算机识别一个人在麦克风中说出的单词,并将其转换为书面文本。鸡尾酒会环境是ASR最具挑战性的情况之一。尽管已经研究了源分离方法来解决此问题,但是在使用基于时频掩码的分离方法的情况下,分离过程并不完美,并且由此产生的伪影对ASR性能造成了额外的问题。最近,作者提出了一种特殊的训练方法来处理实际ASR系统中的同时语音情况。在本文中,我们研究了在不同声学条件下,通过在ASR系统的训练和测试阶段选择不同的分离算法组合,如何影响语音识别性能。结果表明,尽管不同的分离方法会产生不同类型的伪像,但在使用任何鸡尾酒会训练时,该方法的总体性能始终会得到提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号