...
首页> 外文期刊>Procedia Computer Science >The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition
【24h】

The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition

机译:特征提取中的小波和傅立叶变换,用于基于文本的,基于滤波器组的说话人识别

获取原文
           

摘要

An important step in speaker recognition is extracting features from raw speech that captures the unique characteristics of each speaker. The most widely used method of obtaining these features is the filterbank-based Mel Frequency Cepstral Coefficients (MFCC) approach. Typically, an important step in the process is the employment of the discrete Fourier transform (DFT) to compute the spectrum of the speech waveform. However, over the past few years, the discrete wavelet transform (DWT) has gained remarkable attention, and has been favored over the DFT in a wide variety of applications. This work compares the performance of the DFT with the DWT in the computation of MFCC in the feature extraction process for speaker recognition. It is shown that the DWT results in significantly lower order for the Gaussian Mixture Model (GMM) used to model speech and marginal improvement in accuracy.
机译:说话者识别的重要步骤是从原始语音中提取特征,以捕获每个说话者的独特特征。获得这些特征的最广泛使用的方法是基于滤波器组的梅尔频率倒谱系数(MFCC)方法。通常,该过程中的一个重要步骤是采用离散傅立叶变换(DFT)来计算语音波形的频谱。但是,在过去的几年中,离散小波变换(DWT)引起了人们的极大关注,并且在各种应用中都比DFT更受青睐。这项工作在说话人识别的特征提取过程中,将DFT与DWT在MFCC计算中的性能进行了比较。结果表明,DWT导致用于建模语音的高斯混合模型(GMM)的顺序明显降低,并且边缘精度有所提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号