首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition
【24h】

Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition

机译:结合本征语音说话者建模和基于VTS的环境补偿以实现强大的语音识别

获取原文

摘要

Eigenvoice and vector Taylor series (VTS) are good models for speaker differences and environmental variations separately. However, speaker and environmental variation always coexist in real-world speech. In this paper, we propose to combine eigenvoice and VTS. Specifically, we introduce eigenvoice speaker modeling for the clean speech into VTS's nonlinear mismatch function. In contrast, the standard VTS uses speaker-independent modeling to represent the clean speech, regardless of speaker differences. The eigenvoice coefficients and the noise model parameters are jointly estimated in the new approach. Experimental results on the Aurora2 task show the improved performances of combining eigenvoice and VTS and demonstrate its ability for speaker and noise factorization.
机译:特征语音和矢量泰勒级数(VTS)是分别针对说话者差异和环境变化的良好模型。但是,说话者和环境变化总是在现实世界中并存。在本文中,我们建议将特征语音和VTS相结合。具体来说,我们将纯语音的本征语音扬声器建模引入VTS的非线性失配函数中。相反,标准的VTS使用独立于说话者的建模来代表清晰的语音,而不管说话者的差异如何。在新方法中,本征语音系数和噪声模型参数是联合估计的。在Aurora2任务上进行的实验结果表明,将本征语音和VTS结合起来可以提高性能,并证明其具有说话人和噪声分解的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号