首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions
【24h】

The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions

机译:会议室ASR中的非理性游戏:在嘈杂和不匹配条件下分析功能与模型错误

获取原文

摘要

Given a test waveform, state-of-the-art ASR systems extract a sequence of MFCC features and decode them with a set of trained HMMs. When this test data is clean, and it matches the condition used for training the models, then there are few errors. While it is known that ASR systems are brittle in noisy or mismatched conditions, there has been little work in quantitatively attributing the errors to features or to models. This paper attributes the sources of these errors in three conditions: (a) matched near-field, (b) matched far-field, and a (c) mismatched condition. We undertake a series of diagnostic analyses employing the bootstrap method to probe a meeting room ASR system. Results show that when the conditions are matched (even if they are far-field), the model errors dominate; however, in mismatched conditions features are neither invariant nor separable and this causes as many errors as the model does.
机译:给定一个测试波形,最新的ASR系统将提取一系列MFCC功能并使用一组训练有素的HMM对其进行解码。当该测试数据是干净的,并且符合训练模型所使用的条件时,则几乎没有错误。尽管已知ASR系统在嘈杂或不匹配的条件下会很脆弱,但是在将误差归因于特征或模型方面几乎没有任何工作。本文将这些错误的原因归结为三种情况:(a)匹配的近场,(b)匹配的远场和(c)不匹配的情况。我们使用自举方法进行一系列诊断分析,以探查会议室ASR系统。结果表明,当条件匹配时(即使它们是远场条件),模型误差仍占主导地位;但是,在不匹配的条件下,特征既不是不变的也不是可分离的,这会导致与模型一样多的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号