首页> 外文学位 >Mispronunciation detection for language learning and speech recognition adaptation.
【24h】

Mispronunciation detection for language learning and speech recognition adaptation.

机译:错误识别检测用于语言学习和语音识别适应。

获取原文
获取原文并翻译 | 示例

摘要

The areas of "mispronunciation detection" (or "accent detection" more specifically) within the speech recognition community are receiving increased attention now. Two application areas, namely language learning and speech recognition adaptation, are largely driving this research interest and are the focal points of this work.;There are a number of Computer Aided Language Learning (CALL) systems with Computer Aided Pronunciation Training (CAPT) techniques that have been developed. In this thesis, a new HMM-based text-dependent mispronunciation system is introduced using text Adaptive Frequency Cepstral Coefficients (AFCCs). It is shown that this system outperforms the conventional HMM method based on Mel Frequency Cepstral Coefficients (MFCCs). In addition, a mispronunciation detection and classification algorithm based on Principle Component Analysis (PCA) is introduced to help language learners identify and correct their pronunciation errors at the word and syllable levels.;To improve speech recognition by adaptation, two projects have been explored. The first one improves name recognition by learning acceptable variations in name pronunciations, as one of the approaches to make grammar-based name recognition adaptive. The second project is accent detection by examining the shifting of fundamental vowels in accented speech. This approach uses both acoustic and phonetic information to detect accents and is shown to be beneficial with accented English. These applications can be integrated into an automated international calling system, to improve recognition of callers' names and speech. It determines the callers' accent based in a short period of speech. Once the type of accents is detected, it switches from the standard speech recognition engine to an accent-adaptive one for better recognition results.
机译:语音识别社区中的“错误发音检测”(或更具体地讲是“重音检测”)领域现在正受到越来越多的关注。语言学习和语音识别适应这两个应用领域在很大程度上推动了这项研究的兴趣,并且是这项工作的重点。;有许多采用计算机辅助语音训练(CAPT)技术的计算机辅助语言学习(CALL)系统已经开发出来的。本文采用文本自适应频率倒谱系数(AFCC),介绍了一种新的基于HMM的基于文本的错误发音系统。结果表明,该系统优于基于梅尔频率倒谱系数(MFCC)的传统HMM方法。此外,引入了一种基于主成分分析(PCA)的错误发音检测和分类算法,以帮助语言学习者在单词和音节级别上识别和纠正其发音错误。为了改进通过适应的语音识别,探索了两个项目。第一种方法是通过学习可接受的名称发音变化来改善名称识别,这是使基于语法的名称识别自适应的方法之一。第二个项目是通过检查语音中基本元音的移动来检测语音。这种方法同时使用声音和语音信息来检测重音,并显示出对重音英语的好处。这些应用程序可以集成到自动国际呼叫系统中,以改善对呼叫者姓名和语音的识别。它可以根据简短的语音来确定呼叫者的口音。一旦检测到重音类型,它就会从标准语音识别引擎切换到支持重音的引擎,以获得更好的识别结果。

著录项

  • 作者

    Ge, Zhenhao.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Engineering Electronics and Electrical.;Information Science.;Computer Science.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 147 p.
  • 总页数 147
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号