Noisy speech recognition using de-noised multiresolution analysis acoustic features

C. P. Chan; P. C. Ching; Tan Lee

首页> 外文期刊>The Journal of the Acoustical Society of America >Noisy speech recognition using de-noised multiresolution analysis acoustic features

【24h】

Noisy speech recognition using de-noised multiresolution analysis acoustic features

机译：使用降噪多分辨率分析声学特征的嘈杂语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a novel application of multiresolution analysis (MRA) in extracting acoustic features that possess de-noising capability for robust speech recognition. The MRA algorithm is used to construct a mel-scaled wavelet packet filter-bank, from which subband powers are computed as the feature parameters for speech recognition. Wiener filtering is applied to a few selected subbands at some intermediate stages of decomposition. For high-frequency bands, Wiener filters are designed based on a reduced fraction of the estimated noise power, making the consonant features such more prominent and contrastive. The proposed method is evaluated in phone recognition experiments with the TIMIT database. In the presence of stationary white noise at 10-dB SNR, the de-noised MRA features attain a phone recognition rate of 32%. There is a noticeable improvement compared with the accuracy of 29% and 20% attained by the commonly used mel-frequency cepstral coefficients (MFCC) with and without cepstral mean normalization (CMN), respectively. The effectiveness of the MRA features is also verified by the fact that they exhibit smaller distortion from clean speech.

机译：本文介绍了一种多分辨率分析（MRA）在提取具有降噪功能以进行鲁棒语音识别的声学特征方面的新应用。 MRA算法用于构建梅尔级小波包滤波器组，从中计算子带功率作为语音识别的特征参数。在分解的某些中间阶段，将维纳滤波应用于几个选定的子带。对于高频段，维纳滤波器的设计是基于降低的估计噪声功率的一部分，从而使辅音特征更加突出和鲜明。该方法在TIMIT数据库的电话识别实验中得到了评估。在SNR为10dB的平稳白噪声存在下，经过降噪的MRA功能可实现32％的电话识别率。与常用的带有倒谱平均归一化（CMN）的梅尔频率倒谱系数（MFCC）分别达到29％和20％的精度相比，有明显的改进。 MRA功能的有效性也得到了验证，因为它们显示的语音清晰程度较小。

著录项

来源
《The Journal of the Acoustical Society of America》 |2001年第1期|共8页
作者
C. P. Chan; P. C. Ching; Tan Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词

相似文献

外文文献
中文文献
专利

1. Noisy speech recognition using de-noised multiresolution analysis acoustic features [J] . C. P. Chan, P. C. Ching, Tan Lee The Journal of the Acoustical Society of America . 2001,第5aPta1期

机译：使用降噪多分辨率分析声学特征的嘈杂语音识别
2. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
3. Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech [J] . Uluskan Seçkin, Sangwan Abhijeet, Hansen John H.John International journal of speech technology . 2017,第4期

机译：基于音素类的特征自适应，用于失配声学建模和识别远处的嘈杂语音
4. Acoustic feature extraction using ERB like wavelet sub-band perceptual Wiener filtering for noisy speech recognition [C] . Biswas A., Sahu P.K., Bhowmick A., Annual IEEE India Conference . 2014

机译：使用ERB像小波子带感知维纳滤波的声学特征提取用于噪声语音识别
5. Acoustic modeling and feature selection for speech recognition. [D] . Zheng, Yanli. 2005

机译：用于语音识别的声学建模和特征选择。
6. Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition [O] . Kun-Ching Wang 2015

机译：使用多分辨率纹理分析和声活动检测器的时频特征表示用于现实生活中的语音情感识别
7. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [O] . Darch, Jonathan, Milner, Ben, Vaseghi, Saeed 2008

机译：分布式语音识别架构中基于mel频率倒谱系数的语音特征分析和预测

Noisy speech recognition using de-noised multiresolution analysis acoustic features

摘要

著录项

相似文献

相关主题

期刊订阅