首页> 外文会议>International Speech Communication Association >Personalizing synthetic voices for people with progressive speech disorders:judging voice similarity
【24h】

Personalizing synthetic voices for people with progressive speech disorders:judging voice similarity

机译:为具有渐进性语音障碍的人的个性化合成声音:判断语音相似性

获取原文

摘要

In building personalized synthetic voices for people with speech disorders, the output should capture the individual's vocal identity.This paper reports a listener judgment experiment on the similarity of Hidden Markov Model based synthetic voices using varying amounts of adaptation data to two non-impaired speakers. We conclude that around 100 sentences of data is needed to build a voice that retains the characteristics of the target speaker but using more data improves the voice. Experiments using Multi-Layer Perceptrons (MLPs) are conducted to find which acoustic features contribute to the similarity judgments.Results show that mel-cepstral distortion and fraction of voicing agreement contribute most to replicating the similarity judgment but the combination of all features is required for accurate prediction. Ongoing work applies the findings to voice building for people with impaired speech.
机译:在为具有言语障碍的人的人的个性化合成声音中,产出应捕获个人的声音标识。本文使用不同数量的适应数据到两个非受损的扬声器,报告了倾听的Markov模型的相似性对隐马尔可夫模型的相似性。我们得出结论,需要大约100个数据句来构建保留目标扬声器特性但使用更多数据来提高语音。进行使用多层感知(MLPS)的实验以找到有助于相似性判断的声学特征。结果表明,敏感术失真和发声协议的分数最为贡献,以复制相似度判断,但需要所有功能的组合准确的预测。正在进行的工作适用于致辞受损的人的声音建设的调查结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号