Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data

机译：发音语音合成：从自然声学语音数据中获利

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The quality of static phones (e.g. vowels, fricatives, nasals, laterals) generated by articulatory speech synthesizers has reached a high level in the last years. Our goal is to expand this high quality to dynamic speech, i.e. whole syllables, words, and utterances by re-synthesizing natural acoustic speech data. Re-synthesis means that vocal tract action units or articulatory gestures, describing the succession of speech movements, are adapted spatio-temporally with respect to a natural speech signal produced by a natural "model speaker" of Standard German. This adaptation is performed using the software tool SAGA (Sound and Articulatory Gesture Alignment) that is currently under development in our lab. The resulting action unit scores are stored in a database and serve as input for our articulatory speech synthesizer. This technique is designed to be the basis for a unit selection articulatory speech synthesis in the future.

机译：在过去几年中，由发音语音合成器产生的静态电话（例如元音，摩擦音，鼻音，侧音）的质量达到了很高的水平。我们的目标是通过重新合成自然声学语音数据，将这种高质量扩展到动态语音，即整个音节，单词和话语。重新合成意味着，相对于由标准德文的自然“模型说话者”产生的自然语音信号，描述语言运动序列的声道动作单位或发音手势相对于时空进行了调整。使用软件工具SAGA（声音和发音手势对准）执行此调整，该工具目前正在我们的实验室中开发。所得的动作单元分数存储在数据库中，并用作我们的发音语音合成器的输入。该技术被设计为将来的单元选择发音语音合成的基础。

著录项

来源
《Cross-modal analysis of speech, gestures, gaze and facial expressions》|2008年|344-355|共12页
会议地点 Prague(CZ)
作者
Dominik Bauer; Jim Kannampuzha; Bernd J. Kroeger;
展开▼
作者单位

Department of Phoniatrics, Pedaudiology, and Communication Disorders,University Hospital Aachen and RWTH Aachen University, Aachen, Germany;

rnDepartment of Phoniatrics, Pedaudiology, and Communication Disorders,University Hospital Aachen and RWTH Aachen University, Aachen, Germany;

rnDepartment of Phoniatrics, Pedaudiology, and Communication Disorders,University Hospital Aachen and RWTH Aachen University, Aachen, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
speech; articulatory speech synthesis; articulation; re-synthesis; vocal tract action units;

机译：言语;发音语音合成；关节;重新合成声道动作单位;

相似文献

外文文献
中文文献
专利

1. Estimation of articulatory movements from speech acoustics using an HMM-based speech production model [J] . Hiroya S., Honda M. IEEE Transactions on Speech and Audio Proceessing . 2004,第2期

机译：使用基于HMM的语音产生模型估计语音声学中的发音运动
2. Estimation of articulatory movements from speech acoustics using an HMM-based speech production model [J] . Hiroya S., Honda M. IEEE Transactions on Speech and Audio Proceeding . 2004,第2期

机译：使用基于HMM的语音产生模型估计语音声学中的发音运动
3. The contrast between alveolar and velar stops with typical speech data: acoustic and articulatory analyses [J] . Roberta Michelon Melo, Larissa Cristina Berti, Helena Bolli Mota CoDAS . 2017,第3期

机译：肺泡和肺泡之间的对比通过典型的语音数据停止：声学和发音分析
4. Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data [C] . Dominik Bauer, Jim Kannampuzha, Bernd J. Kroger COST Action 2102 International Conference on Cross-Modal Analysis of Speech, Gestures,Gaze and Facial Expressions . 2009

机译：明晰的语音重新合成：从自然声学语音数据中获利
5. Acoustic indicators of articulatory organization in babbling and early speech [D] . Earnest, Margaret Moffitt 2005

机译：说话和说话时发音组织的声音指标
6. A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model [O] . Sankaran Panchapagesan, Abeer Alwan -1

机译：使用链矩阵和前田发音模型通过合成分析对语音进行语音到发音发音转换的研究
7. Mixture density networks, human articulatory data and acoustic-to-articulatory inversion of continuous speech. [O] . Richmond Korin 2001

机译：混合密度网络，人类发音数据和连续语音的语音到语音的倒置。
8. Stochastic Articulatory-to-Acoustic Mapping as a Basis for Speech Recognition [R] . Hogden, J. E., Valdez, P. F. 2000

机译：随机发音 - 声学映射作为语音识别的基础

Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data

摘要

著录项

相似文献

相关主题

期刊订阅