Dynamic Selection of Feature Spaces for robust Speech Recognition

机译：强大的语音识别功能空间的动态选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Selection of acoustic features for robust speech recognition has been the subject of research for several years. In the past, algorithms that use feature vectors from multiple frequency bands [9], or employ techniques to switch between multiple feature streams [10] have been reported in the literature to handle robustness under different acoustic conditions. Acoustic models built out of differnet feature sets produce different kinds of recognition errors. In this paper, we propose a likelihood-based scheme to combine the acoustic feature vectors from multiple signal processing schemes within the decoding framework, in order to extract maximum benefit from these different acoustic feature vectors from multiple signal processing schemes within the decoding framework, in order to extract maximum benefit from these differnet acoustic feature vectors and models. The proposed technique is general enough to be applied to other pattern recognition fields, such as, OCR, handwriting recognition, etc. The fundamental idea behind this approach is to pick the set of features that classifies a frame of speech accurately with no apriori information about the phonetic class or acoustic channel that this speech comes from. Two methods of merging any set of acosutic features, such as, formant-based features, cepstral feature vectors, PLP features, LDA features etc.

机译：用于强大的语音识别的声学特征的选择已经是研究几年的主题。在过去，在文献中报告了使用来自多个频带[9]的特征向量的算法，或者采用在多个特征流[10]之间进行切换的技术，以处理不同声学条件下的鲁棒性。由不同的功能集构建的声学模型产生不同类型的识别错误。在本文中，我们提出了一种基于似然的方案，以将声学特征向量与解码框架内的多个信号处理方案组合，以便从解码框架内的多个信号处理方案中提取来自这些不同的声学特征向量的最大益处为了从这些不同的声学特征向量和模型中提取最大益处。所提出的技术足以应用于其他模式识别字段，例如OCR，手写识别等。这种方法背后的基本构思是选择一组特征，可以准确地分类语音帧，没有关于的APRIORI信息这种语音来自的语音类或声学渠道。合并任何一组拟拟种特征的方法，例如基于格式的特征，颅骨特征向量，PLP特征，LDA特征等。

著录项

来源
《International conference on spoken language processing》|2000年||共4页
会议地点
作者
Bhuvana Ramabhadran; Yuqing Gao; Michael Picheny;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G18;
关键词

相似文献

外文文献
中文文献
专利

1. Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor [J] . Marc DELCROIX, Tomohiro NAKATANI, Shinji WATANABE 電子情報通信学会技術研究報告. 音声. Speech . 2007,第406期

机译：动态特征方差自适应，可通过语音增强预处理器实现健壮的语音识别
2. Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor [J] . Marc DELCROIX, Tomohiro NAKATANI, Shinji WATANABE 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2007,第405期

机译：动态特征方差自适应，可通过语音增强预处理器实现健壮的语音识别
3. Unsupervised feature selection and NMF de-noising for robust Speech Emotion Recognition [J] . Bandela Surekha Reddy, Kumar T. Kishore Applied Acoustics . 2021,第Jana期

机译：无监督的功能选择和NMF用于强大语音情感识别的脱模
4. Dynamic Selection of Feature Spaces for robust Speech Recognition [C] . Bhuvana Ramabhadran, Yuqing Gao, Michael Picheny 6th International conference on Spoken Language Processing ICSLP 2000 Oct. 16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：特征空间的动态选择以增强语音识别能力
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Non-Linear Transformations Of The Feature Space For Robust Speech Recognition [O] . Angel de la Torre, José C. Segura, Carmen Benítez, 2002

机译：特征空间的非线性变换以实现稳健的语音识别

Dynamic Selection of Feature Spaces for robust Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅