首页> 外文学位 >A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models.

【24h】

A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models.

机译：具有高效MFCC提取算法和多混合模型的语音识别IC。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition (ASR) by machine has received a great deal of attention in past decades. Speech recognition algorithms based on the Mel frequency cepstrum coefficient (MFCC) and the hidden Markov model (HMM) have a better recognition performance compared with other speech recognition algorithms and are widely used in many applications. In this thesis a speech recognition system with an efficient MFCC extraction algorithm and multi-mixture models is presented. It is composed of two parts: a MFCC feature extractor and a HMM-based speech decoder.; In the conventional MFCC feature extraction algorithm, speech is separated into some short overlapped frames. The existing extraction algorithm requires a lot of computations and is not suitable for hardware implementation. We have developed a hardware efficient MFCC feature extraction algorithm in our work. The new algorithm reduces the computational power by 54% compared to the conventional algorithm with only 1.7% reduction in recognition accuracy.; For the HMM-based decoder of the speech recognition system, it is advantageous to use models with multi mixtures, but with more mixtures the calculation becomes more complicated. Using a table look-up method proposed in this thesis the new design can handle up to 16 states and 8 mixtures. This new design can be easily extended to handle models which have more states and mixtures. We have implemented the new algorithm with an Altera FPGA chip using fix-point calculation and tested the FPGA chip with the speech data from the AURORA 2 database, which is a well known database designed to evaluate the performance of speech recognition algorithms in noisy conditions [27]. The recognition accuracy of the new system is 91.01%. A conventional software recognition system running on PC using 32-bit floating point calculation has a recognition accuracy of 94.65%.

机译：在过去的几十年中，机器自动语音识别（ASR）受到了广泛的关注。与其他语音识别算法相比，基于梅尔频率倒谱系数（MFCC）和隐马尔可夫模型（HMM）的语音识别算法具有更好的识别性能，并被广泛应用于许多应用中。本文提出了一种具有高效MFCC提取算法和多种混合模型的语音识别系统。它由两部分组成：MFCC特征提取器和基于HMM的语音解码器。在常规的MFCC特征提取算法中，语音被分成一些短的重叠帧。现有的提取算法需要大量的计算，并且不适合硬件实现。我们在工作中开发了一种硬件高效的MFCC特征提取算法。与传统算法相比，新算法将计算能力降低了54％，识别精度仅降低了1.7％。对于语音识别系统的基于HMM的解码器，使用具有多种混合的模型是有利的，但是混合越多，计算就越复杂。使用本文提出的查表方法，新设计可以处理多达16种状态和8种混合物。可以轻松扩展此新设计，以处理状态和混合更多的模型。我们已经使用Altera FPGA芯片通过定点计算实现了新算法，并使用AURORA 2数据库中的语音数据对FPGA芯片进行了测试，该数据库是一个著名的数据库，旨在评估嘈杂条件下的语音识别算法的性能[ 27]。新系统的识别精度为91.01％。使用32位浮点计算在PC上运行的常规软件识别系统的识别精度为94.65％。

著录项

作者
Han, Wei.;
展开▼
作者单位

The Chinese University of Hong Kong (Hong Kong).;

展开▼
授予单位 The Chinese University of Hong Kong (Hong Kong).;
学科 Engineering Electronics and Electrical.
学位 Ph.D.
年度 2006
页码 255 p.
总页数 255
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Energy-Efficient Floating-Point MFCC Extraction Architecture for Speech Recognition Systems [J] . Jo Jihyuck, Yoo Hoyoung, Park In-Cheol Very Large Scale Integration (VLSI) Systems, IEEE Transactions on . 2016,第2期

机译：语音识别系统的节能浮点MFCC提取架构
2. Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System [J] . A. A. Alasadi, T. H. Aldhayni, R. R. Deshmukh, Engineering Technology and Applied Science Research . 2020,第2期

机译：高效的特征提取算法开发阿拉伯语语音识别系统
3. Efficient Noise Robust Feature Extraction Algorithms for Distributed Speech Recognition (DSR) Systems [J] . BOJAN KOTNIK, DAMJAN VLAJ, BOGOMIR HORVAT International journal of speech technology . 2003,第3期

机译：分布式语音识别（DSR）系统的高效噪声稳健特征提取算法
4. Energy-efficient MFCC extraction architecture in mixed-signal domain for automatic speech recognition [C] . Qin Li, Huifeng Zhu, Fei Qiao, IEEE/ACM International Symposium on Nanoscale Architectures . 2018

机译：混合信号域中的节能MFCC提取架构，用于自动语音识别
5. Novel algorithms for video text extraction with application to license plate recognition. [D] . Chen, Minya. 2004

机译：用于视频文本提取的新算法，并应用于车牌识别。
6. Efficient Adaptive Speech Reception Threshold Measurements UsingStochastic Approximation Algorithms [O] . Gertjan Dingemanse, André Goedegebure 2019

机译：使用的高效自适应语音接收阈值测量随机近似算法
7. MSP-MFCC: Energy-Efficient MFCC Feature Extraction Method With Mixed-Signal Processing Architecture for Wearable Speech Recognition Applications [O] . Qin Li, Yuze Yang, Tianxiang Lan, 2020

机译：MSP-MFCC：节能MFCC功能提取方法，具有用于可佩戴式语音识别应用的混合信号处理架构

A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models.

摘要

著录项

相似文献

相关主题

期刊订阅