TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

YANPING LI; LINGHUA ZHANG; HUI DING

首页> 外文期刊>International Journal of Information Acquisition >TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

【24h】

TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

机译：基于汉语语音分类和核本征语音高斯混合模型的文本无关语音转换

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposed a novel algorithm for text-independent voice conversion based on Chinese phoneme classification and kernel eigenvoices Gaussian mixture model. The phoneme classification can avoid the disturbance of linguistic information and spectral smoothing. A speaker adaptation technique of kernel eigenvoices was employed for performing spectral conversion between speakers for each category phoneme, adapting the conversion parameters derived for the pre-stored pairs of speakers to a desired pair, which can relax the parallel constraint effectively. Objective test on the spectral conversion accuracy demonstrated that the proposed kernel algorithm can effectively exploit the nonlin-earity in supervector space. In subjective listening test, an ABX test was performed and the proposed algorithm was preferred to the existing eigenvoice algorithm by 4.75%, and improved quality by 10.91% in terms of mean opinion score (MOS). Both objective and subjective tests demonstrated that the proposed algorithm effectively enhanced speech quality and speaker individuality in a text-independent manner.

机译：提出了一种基于中文音素分类和核本征高斯混合模型的文本无关语音转换新算法。音素分类可以避免语言信息和频谱平滑的干扰。对于每个类别音素，采用内核特征语音的说话人自适应技术在说话人之间执行频谱转换，将针对预存的说话人对导出的转换参数调整为所需的对，从而可以有效地缓解并行约束。对频谱转换精度的客观测试表明，所提出的核算法可以有效利用超向量空间中的非线性。在主观听觉测试中，进行了ABX测试，所提出的算法比现有的本征语音算法要低4.75％，在平均意见得分（MOS）方面，质量要提高10.91％。客观测试和主观测试均表明，该算法以独立于文本的方式有效地提高了语音质量和说话人个性。

著录项

来源
《International Journal of Information Acquisition》 |2011年第4期|303-314|共12页
作者
YANPING LI; LINGHUA ZHANG; HUI DING;
展开▼
作者单位

College of Telecommunications & Information Engineering,Nanjing University of Posts and Telecommunications Nanjing, Jiangsu, P. R. China;

College of Telecommunications & Information Engineering,Nanjing University of Posts and Telecommunications Nanjing, Jiangsu, P. R. China;

Jiaxing University, Jiaxing, P. R. China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
voice conversion; phoneme classification; kernel eigenvoices; text-independent;

机译：语音转换;音素分类核心特征语音;与文本无关;

相似文献

外文文献
中文文献
专利

1. Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models [J] . Bao Jingyi, Xu Ning Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2018,第34a36期

机译：利用高斯混合模型使用核心模拟谱密度的基于高斯过程的语音转换
2. Interpretable parametric voice conversion functions based on Gaussian mixture models and constrained transformations [J] . Daniel Erro, Agustin Alonso, Luis Serrano, Computer speech and language . 2015,第1期

机译：基于高斯混合模型和约束变换的可解释参数语音转换功能
3. Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models [J] . Hironori DOI, Keigo NAKAMURA, Tomoki TODA, IEICE transactions on information and systems . 2010,第9期

机译：基于高斯混合模型的统计语音转换的食道语音增强
4. Text-Independent Voice Conversion Based on Kernel Eigenvoice [C] . Yanping Li, Linghua Zhang, Hui Ding AICI 2010;International conference on artificial intelligence and computational intelligence . 2010

机译：基于核特征语音的文本无关语音转换
5. A Gaussian mixture model based classification scheme for myoelectric control of powered upper limb prostheses. [D] . Huang, Yonghong. 2005

机译：基于高斯混合模型的动力上肢假体肌电控制分类方案。
6. Gaussian Mixture Model-based Classification of DCE-MRI data For Identifying Diverse Tumor Microenvironments: Preliminary Results [O] . S. H. Han, E. Ackerstaff, R. Stoyanova, -1

机译：基于高斯混合模型的DCE-MRI数据分类用于鉴定不同肿瘤微环境：初步结果
7. Speaker Adaptive Training for One-to-Many Eigenvoice Conversion Based on Gaussian Mixture Model [O] . Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, 2007

机译：基于高斯混合模型的一对多特征语音说话人自适应训练

TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

摘要

著录项

相似文献

相关主题

期刊订阅