The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum

Tianyang Long; Long Zhang; Tingfa Xu; Shuangwei Wang

首页> 外文期刊>Journal of software >The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum

【24h】

The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum

机译：基于希尔伯特谱的双音节汉语单词语音识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Here a Chinese lexical recognition task is studied by a small vocabulary including 40 double-syllable Chinese words. In the approach presented, the Hilbert-Huang Transform (HHT) which consists of two steps is applied to speech signal analyzing. First, the speech signals are decomposed into a set of intrinsic mode functions (IMFs) by using the empirical mode decomposition (EMD) technique. Second, the first two IMFs are retained for further Hilbert spectral analysis. Final presentation of the speech signal is an energy-frequency-time distribution designated as the Hilbert spectrum, which can be used to depict the characteristics of speech sounds. For feature extraction, the Hilbert spectrum of each speech signal is divided into a set of frequency sub-bands. The number of discrete points on the Hilbert spectrum each sub-band contained is calculated as an element of the feature vector. Feature vectors obtained are fed to Support Vector Machine (SVM) classifier for classification. The proposed method is evaluated using 3840 speech samples from 8 different speakers (4 male). The experimental result, overall recognition rate of the 40 words achieving around 97% demonstrates the effectiveness of this approach.

机译：这里的汉语词汇识别任务是通过一个包含40个双音节汉语单词的小词汇来研究的。在提出的方法中，将由两步组成的希尔伯特-黄变换（HHT）应用于语音信号分析。首先，通过使用经验模式分解（EMD）技术将语音信号分解为一组固有模式函数（IMF）。其次，保留前两个IMF以进行进一步的希尔伯特频谱分析。语音信号的最终呈现是称为希尔伯特频谱的能量-频率-时间分布，可用于描述语音的特征。为了进行特征提取，将每个语音信号的希尔伯特频谱划分为一组频率子带。计算每个子带包含的希尔伯特频谱上离散点的数量，作为特征向量的元素。获得的特征向量被馈送到支持向量机（SVM）分类器进行分类。使用来自8个不同说话者（4个男性）的3840个语音样本对提出的方法进行了评估。实验结果表明，40个单词的整体识别率达到97％左右，证明了该方法的有效性。

著录项

来源
《Journal of software》 |2017年第9期|共12页
作者
Tianyang Long; Long Zhang; Tingfa Xu; Shuangwei Wang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Speech recognitionempirical mode decompositionhilbert-huang transformhilbert spectrum.;

机译：语音识别经验模式分解希尔伯特-黄变换希尔伯特谱;

相似文献

外文文献
中文文献
专利

1. The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum [J] . Tianyang Long, Long Zhang, Tingfa Xu, Journal of Computers . 2017,第9期

机译：基于希尔伯特谱的双音节汉语单词语音识别
2. A comparison of recognition performances in speech-spectrum noise by listeners with normal hearing on PB-50, CID W-22, NU-6, W-1 spondaic words, and monosyllabic digits spoken by the same speaker. [J] . Wilson RH, McArdle R, Roberts H Journal of the American Academy of Audiology . 2008,第6期

机译：PB-50，CID W-22，Nu-6，W-1纯粹词语和同一扬声器中展示的语音频谱噪声识别性能的比较。
3. English Phrase Speech Recognition Based on Continuous Speech Recognition Algorithm and Word Tree Constraints [J] . Haifan Du, Haiwen Duan Complexity . 2021,第a期

机译：英语短语语音识别基于连续语音识别算法和字树约束
4. Novel Hilbert Energy Spectrum Based Features for Speech Emotion Recognition [C] . Xin Li, Xiang Li 2010 WASE International Conference on Information Engineering . 2010

机译：基于新型希尔伯特能量谱的语音情感识别功能
5. Learning Out-of-Vocabulary Words in Automatic Speech Recognition. [D] . Qin, Long. 2013

机译：在自动语音识别中学习词汇外单词。
6. Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space [O] . Kan Li, José C. Príncipe 2018

机译：仿生希尔伯特空间上基于数字启发的基于穗的孤立数字自动语音识别
7. Word recognition from speech signal using linear predictive coding and spectrum analysis [O] . Mandeep Singh, Gurpreet Singh 2018

机译：使用线性预测编码和频谱分析从语音信号识别语音信号

The Speech Recognition of Double-Syllable Chinese Words Based on the Hilbert Spectrum

摘要

著录项

相似文献

相关主题

期刊订阅