首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM
【24h】

Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM

机译:使用KL-HMM进行声数据驱动的音素到音素转换

获取原文

摘要

This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.
机译:本文提出了一种新颖的音素到音素(G2P)转换方法,该方法首先使用基于Kullback-Leibler散度的隐马尔可夫模型(KL-HMM)系统从声学数据中捕获音素与音素之间的概率关系。然后,通过简单的解码框架,将这种概率关系中的信息与单词的正字法转录中的序列信息集成在一起,以推断音素序列。所提出的G2P方法的主要应用之一是在基于低语言资源的自动语音识别或文本转语音系统领域。我们通过仿真研究证明了这种潜力,在仿真研究中,来自一个域的语言资源被用来创建用于另一个域的语言资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号