首页> 美国政府科技报告 >Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams.
【24h】

Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams.

机译:用于噪声和声学异构音频流中关键字定位的丰富系统组合。

获取原文

摘要

We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using a rich system combination with a diverse set of noise-robust modules and audio characteriza- tion. Audio search applications so far have focused on constrained domains or genres and not-so-noisy and heterogeneous acoustic or channel conditions. In this paper, our focus is to improve the ac- curacy of a keyword spotting spotting system in a highly degraded and diverse channel conditions by employing multiple recognition systems in parallel with different robust frontends and modeling choices, as well as different representations during audio indexing and search (words vs. subword units). Then, after aligning keyword hits from different systems, we employ system combination at the score level using a logistic-regression-based classifier. When avail- able, side information (such as signal-to-noise ratio or the output of an acoustic condition identification module) is used to guide sys- tem combination that is trained on separate held-out data. Lattice- based indexing and search is used in all keyword spotting systems. We present improvements in probability-miss at a fixed probability- false-alarm by employing our proposed rich system combination approach on DARPA Robust Audio Transcription (RATS) Phase- I evaluation data that contains highly degraded channel recordings (SNR as low as 0 dB) and different channel characteristics.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号