Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System

机译：语音识别系统中基于小波变换的MFCC特征提取语音降噪

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mel frequency cepstral coefficient (MFCC) is a popular feature extraction method for a speech recognition system. However, this method is susceptible to noise even though it generates a high accuracy. The conventional MFCC method has a degraded performance when the input signal has noises. This paper presents the implementation of denoising wavelet on speech input of MFCC feature extraction method. The addition of denoising process using wavelet transformation was expected to improve the MFCC performance on noisy signals. The study used 120 speech data, with 30 data were used as the reference, and the other 90 were used as the testing data. The testing data were mixed with white Gaussian noise and then tested to the speech recognition system that already had the reference data. Parameters used in the wavelet denoising process were soft thresholding with the Minimaxi thresholding rule. Eleven wavelet methods on decomposition level 10 were tested on the denoising process. The classification process used K-nearest neighbor (KNN) method. The Fejer-Korovkin 6 wavelet was the best denoising speech signal method that achieved the highest accuracy on input signals with SNR of 5-15dB. Meanwhile, the Daubechies 5 method had a high accuracy on input signal with SNR of 3 dB. All of the tested denoising methods using wavelet transformation were able to improve the accuracy of the speech recognition system on input signals with SNR of 0-10 dB compared to the system without denoising method.

机译：梅尔频率倒谱系数（MFCC）是一种用于语音识别系统的流行特征提取方法。但是，即使该方法产生高精度，也容易受到噪声的影响。当输入信号有噪声时，传统的MFCC方法的性能会下降。本文提出了MFCC特征提取方法在语音输入中去噪小波的实现。期望增加使用小波变换的去噪处理，以改善噪声信号下的MFCC性能。该研究使用了120个语音数据，其中30个数据用作参考，另外90个数据用作测试数据。将测试数据与高斯白噪声混合，然后测试到已经具有参考数据的语音识别系统。小波去噪过程中使用的参数是使用Minimaxi阈值规则的软阈值。在去噪过程中测试了十种分解级别为10的小波方法。分类过程使用K最近邻法（KNN）。 Fejer-Korovkin 6小波是最佳的降噪语音信号方法，在SNR为5-15dB的输入信号上实现了最高的精度。同时，Daubechies 5方法对输入信号具有3 dB的SNR的高精度。与未采用降噪方法的系统相比，使用小波变换的所有经过测试的降噪方法均能够提高语音识别系统对SNR为0-10 dB的输入信号的准确性。

著录项

来源
《International Conference on Information Technology and Electrical Engineering》|2018年|280-284|共5页
会议地点
作者
Risanuri Hidayat; Agus Bejo; Sujoko Sumaryono; Anggun Winursito;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; Noise reduction; Mel frequency cepstral coefficient; Wavelet transforms; Feature extraction; Signal to noise ratio;

机译：语音识别;降噪;梅尔频率倒谱系数;小波变换;特征提取;信噪比;

相似文献

外文文献
中文文献
专利

1. Wavelet-based denoising for robust feature extraction for speech recognition [J] . Farooq O., Datta S. Electronics Letters . 2003,第1期

机译：基于小波的去噪，用于语音识别的鲁棒特征提取
2. DRGAS Sz. Speech recognition by means of feature extraction method based on slope transformation assisted with denoising [J] . DABROWSKI A. Elektronika . 2007,第4期

机译：DRGAS Sz。基于降噪辅助斜率变换的特征提取方法进行语音识别
3. Study on processing of wavelet speech denoising in speech recognition system [J] . Xinmei Zhong, Yunzhong Dai, Yong Dai, International journal of speech technology . 2018,第3期

机译：语音识别系统中小波语音去噪处理的研究
4. Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System [C] . Risanuri Hidayat, Agus Bejo, Sujoko Sumaryono, International Conference on Information Technology and Electrical Engineering . 2018

机译：语音识别系统中使用小波变换的MFCC特征提取的去噪语音
5. Wavelet-based feature extraction for robust speech recognition. [D] . Walker, Shonda Lachelle. 2003

机译：基于小波的特征提取，可实现强大的语音识别。
6. On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition [O] . Juraj Kacur, Boris Puterka, Jarmila Pavlovicova, 2021

机译：语音情感识别中的语音特性和特征提取方法
7. Discrete Wavelet Denoising into MFCC for Noise Suppressive in Automatic Speech Recognition System [O] . Hay Naing, Risanuri Hidayat, Rudy Hartanto, 2020

机译：离散小波去噪到MFCC中的自动语音识别系统中的噪声抑制

Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System

摘要

著录项

相似文献

相关主题

期刊订阅