首页> 外文会议>Audio Engineering Society Convention;Audio Engineering Society. >Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training
【24h】

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

机译:用特定鸡尾酒党培训评估源分离方法对鲁棒自动语音识别的影响

获取原文

摘要

Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.
机译:自动语音识别(ASR)允许计算机识别人们在麦克风中发言并将其转换为书面文本的单词。 ASR中最具挑战性的情况之一是鸡尾酒会的环境。尽管已经研究了源分离方法来处理这个问题,但分离过程并不完美,并且所得到的工件在使用基于时频掩模的分离方法的情况下对ASR性能提出了额外的问题。最近,作者提出了一种特定的培训方法来处理实际ASR系统中的同时语音情况。在本文中,我们研究了通过在不同的声学条件下选择ASR系统的训练和测试阶段的分离算法的不同组合来研究语音识别性能的影响。结果表明,虽然不同的分离方法产生不同类型的伪影,但在使用任何鸡尾酒党培训时,该方法的整体性能总是增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号