Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

机译：用特定鸡尾酒党培训评估源分离方法对鲁棒自动语音识别的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.

机译：自动语音识别（ASR）允许计算机识别人们在麦克风中发言并将其转换为书面文本的单词。 ASR中最具挑战性的情况之一是鸡尾酒会的环境。尽管已经研究了源分离方法来处理这个问题，但分离过程并不完美，并且所得到的工件在使用基于时频掩模的分离方法的情况下对ASR性能提出了额外的问题。最近，作者提出了一种特定的培训方法来处理实际ASR系统中的同时语音情况。在本文中，我们研究了通过在不同的声学条件下选择ASR系统的训练和测试阶段的分离算法的不同组合来研究语音识别性能的影响。结果表明，虽然不同的分离方法产生不同类型的伪影，但在使用任何鸡尾酒党培训时，该方法的整体性能总是增加。

著录项

来源
《Audio Engineering Society Convention;Audio Engineering Society.》|2012年||共8页
会议地点
作者
Amparo Marti; Maximo Cobos; Jose J. Lopez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.45083;
关键词
Evaluating; Source Separation Methods; Specific Cocktail-Party;

机译：评估;源分离方法;特定的鸡尾酒派对;

相似文献

外文文献
中文文献
专利

1. Automatic speech recognition in cocktail-party situations: A specific training for separated speech [J] . Marti A., Cobos M., Lopez J.J. The Journal of the Acoustical Society of America . 2012,第2aPta1期

机译：鸡尾酒会情况下的自动语音识别：针对单独语音的专门培训
2. Perceptual evaluation of blind source separation for robust speech recognition [J] . Leandro Di Persia, Diego Milone, Hugo Leonardo Rufiner, Signal processing . 2008,第10期

机译：盲源分离的感知评估可增强语音识别能力
3. Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures [J] . Moore Alastair H., Peso Parada Pablo, Naylor Patrick A. Computer speech and language . 2017,第nova期

机译：语音增强功能可实现强大的自动语音识别：使用基准系统和仪器测量进行评估
4. Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training [C] . Amparo Marti, Maximo Cobos, Jose J. Lopez Audio Engineering Society convention . 2012

机译：评估源分离方法在具有特定鸡尾酒会训练的鲁棒自动语音识别中的影响
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. Diagnostic Assessment of Childhood Apraxia of Speech Using Automatic Speech Recognition (ASR) Methods [O] . John-Paul Hosom, Lawrence Shriberg, Jordan R. Green -1

机译：使用自动语音识别（ASR）方法对儿童言语失用症的诊断评估
7. Comparative Evaluation of Speech Enhancement Methods for Robust Automatic Speech Recognition [O] . Kuldip K. Paliwal, James G. Lyons, Stephen So, 2011

机译：鲁棒自动语音识别中语音增强方法的比较评估

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

摘要

著录项

相似文献

相关主题

期刊订阅