Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

机译：对具有声音残疾个人的个性化合成声音：语音银行和重建

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

When individuals lose the ability to produce their own speech, due to degenerative diseases such as motor neurone disease (MND) or Parkinson's, they lose not only a functional means of communication but also a display of their individual and group identity. In order to build personalized synthetic voices, attempts have been made to capture the voice before it is lost, using a process known as voice banking. But, for some patients, the speech deterioration frequently coincides or quickly follows diagnosis. Using HMM-based speech synthesis, it is now possible to build personalized synthetic voices with minimal data recordings and even disordered speech. The power of this approach is that it is possible to use the patient's recordings to adapt existing voice models pre-trained on many speakers. When the speech has begun to deteriorate, the adapted voice model can be further modified in order to compensate for the disordered characteristics found in the patient's speech. The University of Edinburgh has initiated a project for voice banking and reconstruction based on this speech synthesis technology. At the current stage of the project, more than fifteen patients with MND have already been recorded and five of them have been delivered a reconstructed voice. In this paper, we present an overview of the project as well as subjective assessments of the reconstructed voices and feedback from patients and their families.

机译：当个人失去产生自己的演讲的能力时，由于诸如运动神经元疾病（MND）或帕金森等退行性疾病，它们不仅失去了常规通信手段，而且还失去了他们的个人和群体身份的展示。为了建立个性化的合成声音，已经尝试在丢失之前捕获声音，使用称为语音银行的过程。但是，对于一些患者来说，语音恶化经常恰好或快速遵循诊断。使用基于肝的语音合成，现在可以使用最小的数据记录和甚至混乱的语音构建个性化的合成声音。这种方法的力量是可以使用患者的录音来调整在许多扬声器上预先培训的现有语音模型。当语音已经开始劣化时，可以进一步修改适应的语音模型，以便补偿患者语音中发现的无序特性。爱丁堡大学已启动基于此语音合成技术的语音银行和重建项目。在该项目的当前阶段，已经记录了超过十五名MND患者，其中五名患者已经交付了重建的声音。在本文中，我们概述了该项目的概述以及重建声音的主观评估以及患者及其家庭的反馈。

著录项

来源
《Workshop on speech and language processing for assistive technologies》|2013年||共5页
会议地点
作者
Christophe Veaux; Junichi Yamagishi; Simon King;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
HTS; Speech Synthesis; Voice Banking; Voice Reconstruction; Voice Output Communication Aids; MND;

机译：HTS;语音合成;语音银行;语音重建;语音输出通信辅助装置;MND;

相似文献

外文文献
中文文献
专利

1. 重新审视书面语篇中的个人声音与社会声音 [J] . 法特梅•巴格里, 邓鹂鸣中国应用语言学：英文版 . 2019,第003期
2. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction [J] . Junichi Yamagishi, Christophe Veaux, Simon King, Acoustical science and technology . 2012,第1期

机译：语音障碍者的语音合成技术：语音存储和重建
3. The Perception of Vocal Traits in Synthesized Voices: Age, Gender, and Human Likeness [J] . ALICE BAIRD, STINA HASSE JORGENSEN, EMILIA PARADA-CABALEIRO, Journal of the Audio Engineering Society . 2018,第4期

机译：合成声音中人声特质的感知：年龄，性别和人像
4. Acoustic analysis of hypernasality by perception of synthesized voices based on spectral modification and vocal tract model [J] . Yukie Kozaki, Hidemi Yoshimasu, Teruo Amagasa, 電子情報通信学会技術研究報告. 音声. Speech . 2003,第154期

机译：基于频谱修改和声道模型的合成语音感知对鼻音的声学分析
5. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction [C] . Christophe Veaux, Junichi Yamagishi, Simon King Workshop on speech and language processing for assistive technologies . 2013

机译：面向残障人士的个性化合成语音：语音存储和重建
6. The Development and Use of a Modified Text Messaging Application Utilizing Voice Output and Pictures/Picture Symbols to Increase Instances of Independent Electronic Communication for Individuals with Moderate to Severe Intellectual and Developmental Disabilities. [D] . Lojkovic, David A. 2015

机译：开发和使用改进的文本消息传递应用程序，该应用程序使用语音输出和图片/图片符号为中度至重度智力和发育障碍的个人增加独立电子通信的实例。
7. Voice Onset Time in Individuals With Hyperfunctional Voice Disorders: Evidence for Disordered Vocal Motor Control [O] . Victoria S. McKenna, Jennifer A. Hylkema, Monique C. Tardif, -1

机译：具有超功能性语音障碍的个人语音发作时间：声音电机控制无序的证据
8. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction [O] . Yamagishi, Junichi, Veaux, Christophe, King, Simon, 2012

机译：语音合成技术适用于有声障碍的人：语音银行和重建

Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

摘要

著录项

相似文献

相关主题

期刊订阅