首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction >A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System
【24h】

A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System

机译:一种混合语音组件,驱动统计韵律控制的多胶TTS合成系统

获取原文

摘要

A polyglot text-to-speech synthesis system which is able to read aloud mixed-lingual text has first of all to derive the correct pronunciation. This is achieved with an accurate morpho-syntactic analyzer that works simultaneously as language detector, followed by a phonological component which performs various phonological transformations. The result of these symbol processing steps is a complete phonological description of the speech to be synthesized. The subsequent processing step, i.e. prosody control, has to generate numerical values for the physical prosodic parameters from this description, a task that is very different from the former ones. This article shows appropriate solutions to both types of tasks, namely a particular rule-based approach for the phonological component and a statistical or machine learning approach to prosody control.
机译:能够朗读混合语言文本的多胶文本到语音合成系统首先才能导出正确的发音。这是通过同时用作语言检测器的准确的杂语分析仪来实现的,然后是进行各种语音​​变换的语音组件。这些符号处理步骤的结果是要合成的语音的完整语音描述。随后的处理步骤,即韵律控制,必须从该描述生成物理韵律参数的数值,这是与前者的任务非常不同的任务。本文为两种类型的任务显示了适当的解决方案,即语音组件的特定规则的方法以及韵律控制的统计或机器学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号