首页> 美国政府科技报告 >Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams.

【24h】

Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams.

机译：用于噪声和声学异构音频流中关键字定位的丰富系统组合。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using a rich system combination with a diverse set of noise-robust modules and audio characteriza- tion. Audio search applications so far have focused on constrained domains or genres and not-so-noisy and heterogeneous acoustic or channel conditions. In this paper, our focus is to improve the ac- curacy of a keyword spotting spotting system in a highly degraded and diverse channel conditions by employing multiple recognition systems in parallel with different robust frontends and modeling choices, as well as different representations during audio indexing and search (words vs. subword units). Then, after aligning keyword hits from different systems, we employ system combination at the score level using a logistic-regression-based classifier. When avail- able, side information (such as signal-to-noise ratio or the output of an acoustic condition identification module) is used to guide sys- tem combination that is trained on separate held-out data. Lattice- based indexing and search is used in all keyword spotting systems. We present improvements in probability-miss at a fixed probability- false-alarm by employing our proposed rich system combination approach on DARPA Robust Audio Transcription (RATS) Phase- I evaluation data that contains highly degraded channel recordings (SNR as low as 0 dB) and different channel characteristics.

著录项

作者
Akbacak, M.; Burget, L.; Wang, W.; van Hout, J.;
展开▼
作者单位

展开▼
年度 2013
页码 1-5
总页数 5
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Noise; Phonemes; Speech; Acoustic equipment; Acoustics; Adaptation; Distribution; Mean; Models; Strategy; Symposia; Training; Speech adaptation; Acoustic modeling; Phoneme analysis;

机译：噪音;音素;语音;声学设备;声学;适应;分布;均值;模型;策略;专题讨论会;训练;语音适应;声学建模;音素分析;

相似文献

外文文献
中文文献
专利

1. Audio-visual keyword spotting for access technology in children with cerebral palsy and speech impairment [J] . Orlandi Silvia, Huang Jiaqui, McGillivray Josh, Assistive technology: the official journal of RESNA . 2019,第5期

机译：脑瘫和语音障碍儿童接入技术的视听关键字发现
2. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion [J] . Wu Pingping, Liu Hong, Li Xiaofei, Multimedia, IEEE Transactions on . 2016,第3期

机译：基于自适应决策融合的新型视听关键词识别口语描述符
3. QueryGen: Semantic interpretation of keyword queries over heterogeneous information systems [J] . Bobed Carlos, Mena Eduardo Information Sciences: An International Journal . 2016,第Null期

机译：QueryGen：异构信息系统上关键字查询的语义解释
4. Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams [C] . Akbacak Murat, Burget Lukas, Wang Wen, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：丰富的系统组合，可在嘈杂的和听觉上异质的音频流中发现关键字
5. Design of Keyword Spotting System Based on Segmental Time Warping of Quantized Features. [D] . Karmacharya, Piush. 2012

机译：基于量化特征分段时间规整的关键词识别系统设计。
6. Systematic Production of Inactivating and Non-Inactivating Suppressor Mutations at the relA Locus That Compensate the Detrimental Effects of Complete spoT Loss and Affect Glycogen Content in Escherichia coli [O] . Manuel Montero, Mehdi Rahimpour, Alejandro M. Viale, -1

机译：在relA位点系统产生失活和非失活的抑制子突变该突变可补偿完全spoT丢失的有害影响和影响大肠杆菌中糖原含量
7. RICH SYSTEM COMBINATION FOR KEYWORD SPOTTING IN NOISY AND ACOUSTICALLY HETEROGENEOUS AUDIO STREAMS [O] . Murat Akbacak, Lukas Burget, Wen Wang, 2013

机译：关于在噪声和声学异质音频流中关键词的富集系统组合

Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams.

摘要

著录项

相似文献

相关主题

期刊订阅