首页> 外文会议>Cross-modal analysis of speech, gestures, gaze and facial expressions >Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data
【24h】

Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data

机译:发音语音合成:从自然声学语音数据中获利

获取原文
获取原文并翻译 | 示例

摘要

The quality of static phones (e.g. vowels, fricatives, nasals, laterals) generated by articulatory speech synthesizers has reached a high level in the last years. Our goal is to expand this high quality to dynamic speech, i.e. whole syllables, words, and utterances by re-synthesizing natural acoustic speech data. Re-synthesis means that vocal tract action units or articulatory gestures, describing the succession of speech movements, are adapted spatio-temporally with respect to a natural speech signal produced by a natural "model speaker" of Standard German. This adaptation is performed using the software tool SAGA (Sound and Articulatory Gesture Alignment) that is currently under development in our lab. The resulting action unit scores are stored in a database and serve as input for our articulatory speech synthesizer. This technique is designed to be the basis for a unit selection articulatory speech synthesis in the future.
机译:在过去几年中,由发音语音合成器产生的静态电话(例如元音,摩擦音,鼻音,侧音)的质量达到了很高的水平。我们的目标是通过重新合成自然声学语音数据,将这种高质量扩展到动态语音,即整个音节,单词和话语。重新合成意味着,相对于由标准德文的自然“模型说话者”产生的自然语音信号,描述语言运动序列的声道动作单位或发音手势相对于时空进行了调整。使用软件工具SAGA(声音和发音手势对准)执行此调整,该工具目前正在我们的实验室中开发。所得的动作单元分数存储在数据库中,并用作我们的发音语音合成器的输入。该技术被设计为将来的单元选择发音语音合成的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号