首页> 外文期刊>IEICE Transactions on Information and Systems >Speaker Adaptation Method for Acoustic-to-Articulatory Inversion using an HMM-Based Speech Production Model
【24h】

Speaker Adaptation Method for Acoustic-to-Articulatory Inversion using an HMM-Based Speech Production Model

机译:基于基于HMM的语音产生模型的语音到发音反转的说话人自适应方法

获取原文
获取原文并翻译 | 示例
           

摘要

We present a speaker adaptation method that makes it possible to determine articulately parameters from an unknown speaker's speech spectrum using an HMM (Hidden Markov Model)-based speech production model. The model consists of HMMs of articulatory parameters for each phoneme and an articulatory-to-acoustic mapping that transforms the articulatory parameters into a speech spectrum for each HMM state. The model is statistically constructed by using actual articulatory-acoustic data. In the adaptation method, geometrical differences in the vocal tract as well as the articulatory behavior in the reference model are statistically adjusted to an unknown speaker. First, the articulatory parameters are estimated from an unknown speaker's speech spectrum using the reference model. Secondly, the articulatory-to-acoustic mapping is adjusted by maximizing the output probability of the acoustic parameters for the estimated articulatory parameters of the unknown speaker. With the adaptation method, the RMS error between the estimated articulatory parameters and the observed ones is 1.65 mm. The improvement rate over the speaker independent model is 56.1 %.
机译:我们提出了一种说话人自适应方法,该方法可以使用基于HMM(隐马尔可夫模型)的语音产生模型从未知说话人的语音频谱中明确地确定参数。该模型由每个音素的发音参数的HMM和发音到声音的映射组成,该映射将每个HMM状态的发音参数转换为语音频谱。该模型是通过使用实际的关节声学数据进行统计构造的。在适应方法中,参考模型中声道的几何差异以及发音行为在统计学上被调整为未知的说话者。首先,使用参考模型从未知讲话者的语音频谱中估计发音参数。其次,通过最大化未知扬声器的估计发音参数的声学参数的输出概率来调整发音到声学的映射。通过自适应方法,估计的关节参数与所观察到的关节参数之间的RMS误差为1.65 mm。与独立扬声器模型相比,改进率为56.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号