【24h】

Pronunciation Lexicon Adaptation for TTS Voice Building

机译:用于TTS语音构建的语音词典适应

获取原文
获取原文并翻译 | 示例

摘要

This paper describes reducing phone label errors in TTS voice building by means of modeling of speaker pronunciation variants. Each speaker has his or her own unique pronunciations (and context-dependent variations), so that no one standard lexicon is able to cover all of the speaker's variations. Creating speaker-dependent pronunciation lexicons for automatic speech labeling of our TTS voice databases helped to eliminate many pronunciation errors that resulted from mismatches between lexical pronunciations and how the speaker (voice talent) actually pronounced a word. We also found that it contributed other synthesis quality improvement as well. A perceptual test showed that our work contributed to MOS improvement for American English male and female voices.
机译:本文介绍了通过对发音者语音变体进行建模来减少TTS语音构建中的电话标签错误。每个说话者都有自己独特的发音(以及与上下文相关的变体),因此没有一个标准的词典能够涵盖所有说话者的变体。为我们的TTS语音数据库创建自动语音标记的依赖于说话者的发音词典,这有助于消除许多发音错误,这些错误是由于词汇发音之间的不匹配以及发音者(语音才能)实际发音一个单词而导致的。我们还发现它也有助于其他合成质量的提高。知觉测试表明,我们的工作有助于改善美国英语男女声音的MOS。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号