首页> 外文会议>International Conference on Information Technology and Electrical Engineering >Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
【24h】

Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System

机译:语音识别系统中基于小波变换的MFCC特征提取语音降噪

获取原文

摘要

Mel frequency cepstral coefficient (MFCC) is a popular feature extraction method for a speech recognition system. However, this method is susceptible to noise even though it generates a high accuracy. The conventional MFCC method has a degraded performance when the input signal has noises. This paper presents the implementation of denoising wavelet on speech input of MFCC feature extraction method. The addition of denoising process using wavelet transformation was expected to improve the MFCC performance on noisy signals. The study used 120 speech data, with 30 data were used as the reference, and the other 90 were used as the testing data. The testing data were mixed with white Gaussian noise and then tested to the speech recognition system that already had the reference data. Parameters used in the wavelet denoising process were soft thresholding with the Minimaxi thresholding rule. Eleven wavelet methods on decomposition level 10 were tested on the denoising process. The classification process used K-nearest neighbor (KNN) method. The Fejer-Korovkin 6 wavelet was the best denoising speech signal method that achieved the highest accuracy on input signals with SNR of 5-15dB. Meanwhile, the Daubechies 5 method had a high accuracy on input signal with SNR of 3 dB. All of the tested denoising methods using wavelet transformation were able to improve the accuracy of the speech recognition system on input signals with SNR of 0-10 dB compared to the system without denoising method.
机译:梅尔频率倒谱系数(MFCC)是一种用于语音识别系统的流行特征提取方法。但是,即使该方法产生高精度,也容易受到噪声的影响。当输入信号有噪声时,传统的MFCC方法的性能会下降。本文提出了MFCC特征提取方法在语音输入中去噪小波的实现。期望增加使用小波变换的去噪处理,以改善噪声信号下的MFCC性能。该研究使用了120个语音数据,其中30个数据用作参考,另外90个数据用作测试数据。将测试数据与高斯白噪声混合,然后测试到已经具有参考数据的语音识别系统。小波去噪过程中使用的参数是使用Minimaxi阈值规则的软阈值。在去噪过程中测试了十种分解级别为10的小波方法。分类过程使用K最近邻法(KNN)。 Fejer-Korovkin 6小波是最佳的降噪语音信号方法,在SNR为5-15dB的输入信号上实现了最高的精度。同时,Daubechies 5方法对输入信号具有3 dB的SNR的高精度。与未采用降噪方法的系统相比,使用小波变换的所有经过测试的降噪方法均能够提高语音识别系统对SNR为0-10 dB的输入信号的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号