首页> 外文会议>INTERSPEECH 2012 >Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition
【24h】

Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition

机译:语音识别综合鉴别训练和混合分裂

获取原文

摘要

A method is proposed to incorporate mixture density splitting into the acoustic model discriminative training for speech recognition. The standard method is to obtain a high resolution acoustic model by maximum likelihood training and density splitting, and then improving this model by discriminative training. We choose a log-linear form of acoustic model because for a single Gaussian density per triphone state the log-linear MMI optimization is a convex optimization problem, and by further splitting and discriminative training of this model we can get a higher complexity model. Previously it was shown that we achieve large gains in the objective function and corresponding moderate gains in the word error rate on a large vocabulary corpus. This paper incorporates the state of the art minimum phone error training criterion into the framework, and shows that after discriminative splitting, a subsequent log-linear MPE training achieves better results than Gaussian mixture model MPE optimization alone.
机译:提出了一种方法将混合密度分裂掺入语音识别的声学模型鉴别训练中。标准方法是通过最大似然训练和密度分裂获得高分辨率声学模型,然后通过鉴别培训改善该模型。我们选择声学模型的逻辑线性形式,因为对于每个三士的单个高斯密度,Log-Linear MMI优化是凸优化问题,并且通过对该模型的进一步分裂和鉴别训练,我们可以获得更高的复杂性模型。以前表明,我们在大型词汇表中实现了目标函数的巨大收益和相应的中等收益。本文采用了艺术最小手机误差训练标准的状态改变成框架,并表明,歧视性拆分后,随后的数线性MPE培训实现了比高斯混合模型MPE单独优化更好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号