首页> 外文会议>Audio Engineering Society convention >Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

【24h】

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

机译：评估源分离方法在具有特定鸡尾酒会训练的鲁棒自动语音识别中的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.

机译：自动语音识别（ASR）允许计算机识别一个人在麦克风中说出的单词，并将其转换为书面文本。鸡尾酒会环境是ASR最具挑战性的情况之一。尽管已经研究了源分离方法来解决此问题，但是在使用基于时频掩码的分离方法的情况下，分离过程并不完美，并且由此产生的伪影对ASR性能造成了额外的问题。最近，作者提出了一种特殊的训练方法来处理实际ASR系统中的同时语音情况。在本文中，我们研究了在不同声学条件下，通过在ASR系统的训练和测试阶段选择不同的分离算法组合，如何影响语音识别性能。结果表明，尽管不同的分离方法会产生不同类型的伪像，但在使用任何鸡尾酒会训练时，该方法的总体性能始终会得到提高。

著录项

来源
《Audio Engineering Society convention》|2012年|p.125-132|共8页
会议地点
作者
Amparo Marti; Maximo Cobos; Jose J. Lopez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;电声技术和语音信号处理;电声器件;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic speech recognition in cocktail-party situations: A specific training for separated speech [J] . Marti A., Cobos M., Lopez J.J. The Journal of the Acoustical Society of America . 2012,第2aPta1期

机译：鸡尾酒会情况下的自动语音识别：针对单独语音的专门培训
2. Perceptual evaluation of blind source separation for robust speech recognition [J] . Leandro Di Persia, Diego Milone, Hugo Leonardo Rufiner, Signal processing . 2008,第10期

机译：盲源分离的感知评估可增强语音识别能力
3. Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures [J] . Moore Alastair H., Peso Parada Pablo, Naylor Patrick A. Computer speech and language . 2017,第nova期

机译：语音增强功能可实现强大的自动语音识别：使用基准系统和仪器测量进行评估
4. Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training [C] . Amparo Marti, Maximo Cobos, Jose J. Lopez Audio Engineering Society convention . 2012

机译：用特定鸡尾酒党培训评估源分离方法对鲁棒自动语音识别的影响
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. Diagnostic Assessment of Childhood Apraxia of Speech Using Automatic Speech Recognition (ASR) Methods [O] . John-Paul Hosom, Lawrence Shriberg, Jordan R. Green -1

机译：使用自动语音识别（ASR）方法对儿童言语失用症的诊断评估
7. Comparative Evaluation of Speech Enhancement Methods for Robust Automatic Speech Recognition [O] . Kuldip K. Paliwal, James G. Lyons, Stephen So, 2011

机译：鲁棒自动语音识别中语音增强方法的比较评估

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

摘要

著录项

相似文献

相关主题

期刊订阅