首页> 外文期刊>International Journal of Information Acquisition >TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL
【24h】

TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

机译:基于汉语语音分类和核本征语音高斯混合模型的文本无关语音转换

获取原文
获取原文并翻译 | 示例
       

摘要

This paper proposed a novel algorithm for text-independent voice conversion based on Chinese phoneme classification and kernel eigenvoices Gaussian mixture model. The phoneme classification can avoid the disturbance of linguistic information and spectral smoothing. A speaker adaptation technique of kernel eigenvoices was employed for performing spectral conversion between speakers for each category phoneme, adapting the conversion parameters derived for the pre-stored pairs of speakers to a desired pair, which can relax the parallel constraint effectively. Objective test on the spectral conversion accuracy demonstrated that the proposed kernel algorithm can effectively exploit the nonlin-earity in supervector space. In subjective listening test, an ABX test was performed and the proposed algorithm was preferred to the existing eigenvoice algorithm by 4.75%, and improved quality by 10.91% in terms of mean opinion score (MOS). Both objective and subjective tests demonstrated that the proposed algorithm effectively enhanced speech quality and speaker individuality in a text-independent manner.
机译:提出了一种基于中文音素分类和核本征高斯混合模型的文本无关语音转换新算法。音素分类可以避免语言信息和频谱平滑的干扰。对于每个类别音素,采用内核特征语音的说话人自适应技术在说话人之间执行频谱转换,将针对预存的说话人对导出的转换参数调整为所需的对,从而可以有效地缓解并行约束。对频谱转换精度的客观测试表明,所提出的核算法可以有效利用超向量空间中的非线性。在主观听觉测试中,进行了ABX测试,所提出的算法比现有的本征语音算法要低4.75%,在平均意见得分(MOS)方面,质量要提高10.91%。客观测试和主观测试均表明,该算法以独立于文本的方式有效地提高了语音质量和说话人个性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号