首页> 美国卫生研究院文献>other >LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP
【2h】

LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP

机译:基于地标的语音识别:2004年JOHNS HOPKINS夏季研讨会的报告

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Three research prototype speech recognition systems are described, all of which use recently developed methods from artificial intelligence (specifically support vector machines, dynamic Bayesian networks, and maximum entropy classification) in order to implement, in the form of an automatic speech recognizer, current theories of human speech perception and phonology (specifically landmark-based speech perception, nonlinear phonology, and articulatory phonology). All three systems begin with a high-dimensional multiframe acoustic-to-distinctive feature transformation, implemented using support vector machines trained to detect and classify acoustic phonetic landmarks. Distinctive feature probabilities estimated by the support vector machines are then integrated using one of three pronunciation models: a dynamic programming algorithm that assumes canonical pronunciation of each word, a dynamic Bayesian network implementation of articulatory phonology, or a discriminative pronunciation model trained using the methods of maximum entropy classification. Log probability scores computed by these models are then combined, using log-linear combination, with other word scores available in the lattice output of a first-pass recognizer, and the resulting combination score is used to compute a second-pass speech recognition output.
机译:描述了三个研究原型语音识别系统,所有这些系统都使用了来自人工智能的最新开发方法(特别是支持向量机,动态贝叶斯网络和最大熵分类),以便以自动语音识别器的形式实现当前的理论人类语音感知和语音学(特别是基于地标的语音感知,非线性语音学和发音语音学)。所有这三个系统均始于高维多帧声学到独特的特征转换,该转换是使用支持向量机实现的,该向量机经过训练以检测和分类声学语音界标。然后,使用以下三种发音模型之一对由支持向量机估计的独特特征概率进行集成:采用动态编程算法(假设每个单词的正读发音),动态贝叶斯网络的发音语音学实现方式或采用以下方法训练的判别发音模型:最大熵分类。然后,使用对数线性组合,将这些模型计算出的对数概率分数与在首过识别器的晶格输出中可用的其他单词分数相组合,然后将所得组合分数用于计算第二遍语音识别输出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号