首页> 外文期刊>Journal of software >The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum
【24h】

The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum

机译:基于希尔伯特谱的双音节汉语单词语音识别

获取原文
           

摘要

Here a Chinese lexical recognition task is studied by a small vocabulary including 40 double-syllable Chinese words. In the approach presented, the Hilbert-Huang Transform (HHT) which consists of two steps is applied to speech signal analyzing. First, the speech signals are decomposed into a set of intrinsic mode functions (IMFs) by using the empirical mode decomposition (EMD) technique. Second, the first two IMFs are retained for further Hilbert spectral analysis. Final presentation of the speech signal is an energy-frequency-time distribution designated as the Hilbert spectrum, which can be used to depict the characteristics of speech sounds. For feature extraction, the Hilbert spectrum of each speech signal is divided into a set of frequency sub-bands. The number of discrete points on the Hilbert spectrum each sub-band contained is calculated as an element of the feature vector. Feature vectors obtained are fed to Support Vector Machine (SVM) classifier for classification. The proposed method is evaluated using 3840 speech samples from 8 different speakers (4 male). The experimental result, overall recognition rate of the 40 words achieving around 97% demonstrates the effectiveness of this approach.
机译:这里的汉语词汇识别任务是通过一个包含40个双音节汉语单词的小词汇来研究的。在提出的方法中,将由两步组成的希尔伯特-黄变换(HHT)应用于语音信号分析。首先,通过使用经验模式分解(EMD)技术将语音信号分解为一组固有模式函数(IMF)。其次,保留前两个IMF以进行进一步的希尔伯特频谱分析。语音信号的最终呈现是称为希尔伯特频谱的能量-频率-时间分布,可用于描述语音的特征。为了进行特征提取,将每个语音信号的希尔伯特频谱划分为一组频率子带。计算每个子带包含的希尔伯特频谱上离散点的数量,作为特征向量的元素。获得的特征向量被馈送到支持向量机(SVM)分类器进行分类。使用来自8个不同说话者(4个男性)的3840个语音样本对提出的方法进行了评估。实验结果表明,40个单词的整体识别率达到97%左右,证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号