首页> 外文会议>Workshop on speech and language processing for assistive technologies >Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction
【24h】

Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

机译:对具有声音残疾个人的个性化合成声音:语音银行和重建

获取原文

摘要

When individuals lose the ability to produce their own speech, due to degenerative diseases such as motor neurone disease (MND) or Parkinson's, they lose not only a functional means of communication but also a display of their individual and group identity. In order to build personalized synthetic voices, attempts have been made to capture the voice before it is lost, using a process known as voice banking. But, for some patients, the speech deterioration frequently coincides or quickly follows diagnosis. Using HMM-based speech synthesis, it is now possible to build personalized synthetic voices with minimal data recordings and even disordered speech. The power of this approach is that it is possible to use the patient's recordings to adapt existing voice models pre-trained on many speakers. When the speech has begun to deteriorate, the adapted voice model can be further modified in order to compensate for the disordered characteristics found in the patient's speech. The University of Edinburgh has initiated a project for voice banking and reconstruction based on this speech synthesis technology. At the current stage of the project, more than fifteen patients with MND have already been recorded and five of them have been delivered a reconstructed voice. In this paper, we present an overview of the project as well as subjective assessments of the reconstructed voices and feedback from patients and their families.
机译:当个人失去产生自己的演讲的能力时,由于诸如运动神经元疾病(MND)或帕金森等退行性疾病,它们不仅失去了常规通信手段,而且还失去了他们的个人和群体身份的展示。为了建立个性化的合成声音,已经尝试在丢失之前捕获声音,使用称为语音银行的过程。但是,对于一些患者来说,语音恶化经常恰好或快速遵循诊断。使用基于肝的语音合成,现在可以使用最小的数据记录和甚至混乱的语音构建个性化的合成声音。这种方法的力量是可以使用患者的录音来调整在许多扬声器上预先培训的现有语音模型。当语音已经开始劣化时,可以进一步修改适应的语音模型,以便补偿患者语音中发现的无序特性。爱丁堡大学已启动基于此语音合成技术的语音银行和重建项目。在该项目的当前阶段,已经记录了超过十五名MND患者,其中五名患者已经交付了重建的声音。在本文中,我们概述了该项目的概述以及重建声音的主观评估以及患者及其家庭的反馈。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号