Noise Robust Speech Recognition Using F_0 Contour Information

Koji IWANO; Takahiro SEKI; Sadaoki FURUI

首页> 外文期刊>IEICE Transactions on Information and Systems >Noise Robust Speech Recognition Using F_0 Contour Information

【24h】

Noise Robust Speech Recognition Using F_0 Contour Information

机译：使用F_0轮廓信息的鲁棒语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, the fundamental frequency (F_0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrases and word boundaries. This paper first describes a noise robust F_0 extraction method using the Hough transform, which achieves high extraction rates under various noise environments. Then it proposes a robust speech recognition method using multi-stream HMMs which model both segmental spectral and F_0 contour information. Speaker-independent experiments are conducted using connected digits uttered by 11 male speakers in various kinds of noise and SNR conditions. The recognition error rate is reduced in all noise conditions, and the best absolute improvement of digit accuracy is about 4.5%. This improvement is achieved by robust digit boundary detection using the prosodic information.

机译：本文提出了一种基于韵律信息的鲁棒语音识别方法。在日语中，基本频率（F_0）轮廓表示短语语调和单词重音信息。因此，它传达了有关韵律短语和单词边界的信息。本文首先介绍了使用霍夫变换的鲁棒F_0噪声提取方法，该方法在各种噪声环境下均能实现较高的提取率。然后，提出了一种使用多流HMM的鲁棒语音识别方法，该方法同时对分段频谱和F_0轮廓信息进行建模。独立扬声器的实验是使用11位男性扬声器在各种噪声和SNR条件下发出的相连数字进行的。在所有噪声条件下，识别错误率都会降低，并且数字精度的最佳绝对提高约为4.5％。通过使用韵律信息进行鲁棒的数字边界检测可以实现此改进。

著录项

来源
《IEICE Transactions on Information and Systems》 |2004年第5期|p.1102-1109|共8页
作者
Koji IWANO; Takahiro SEKI; Sadaoki FURUI;
展开▼
作者单位

Department of Computer Science, Tokyo Institute of Technology, Tokyo, 152-8552 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
noise robust speech recognition; prosody; fundamental frequency (F_0) contour; multi-stream HMM; hough transform;

机译：噪声鲁棒语音识别;韵律;基频（F_0）轮廓;多流HMM;霍夫变换;

相似文献

外文文献
中文文献
专利

1. A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition [J] . Borgstrom B.J., Alwan A. IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans . 2008,第6期

机译：具有说话人归一化功能的低复杂度抛物线形嘴唇轮廓模型，用于噪声鲁棒的视听语音识别中的高级特征提取
2. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 音声. Speech . 2001,第522期

机译：通过集成MLLR自适应和特征提取以降低噪声的语音，增强了噪声鲁棒性
3. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2001,第520期

机译：噪声稳健性语音识别通过集成MLLR自适应和特征提取来减少语音
4. F_0 Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition [C] . Xiaoyun Wang, Xugang Lu, Hisashi Kawai, Annual Conference of the International Speech Communication Association . 2016

机译：基于DNN声学建模在普通话语音识别的实证分解的F_0轮廓分析
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. Noise Robust Automatic Speech Recognition with Adaptive Quantile Based Noise Estimation and Speech Band Emphasizing Filter Bank [O] . Casper Stork Bonde, Carina Graversen, Andreas Gregers Gregersen, 2008

机译：基于自适应分位数的噪声估计和语音带增强滤波器组的鲁棒自动语音识别

Noise Robust Speech Recognition Using F_0 Contour Information

摘要

著录项

相似文献

相关主题

期刊订阅