首页> 外文会议>International Conference on Intelligent Systems and Knowledge Engineering >High quality voice conversion based on ISODATA clustering algorithm
【24h】

High quality voice conversion based on ISODATA clustering algorithm

机译:基于ISODATA聚类算法的高质量语音转换

获取原文

摘要

Two main challenges introduced in current voice conversion are the dependence on parallel training data and the trade-off between speaker similarity and speech quality. To tackle the latter problem, this paper proposes a novel conversion method based on Iterative Self-organizing DATA Analysis Techniques Algorithm (ISODATA) clustering algorithm. Specially, we use ISODATA during the training of Gaussian mixture model, the optimized mixture number can guarantee the validity and accuracy of the GMM model, which can acquire speaker's identity effectively related to speaker similarity between original target speech and converted speech, Next, we combine improved GMM and bilinear frequency warping for the conversion stage, which can get a good balance between speaker similarity and speech quality. Theory analysis and experimental results demonstrate that the proposed algorithm can achieve higher quality and similarity compared with other two methods.
机译:当前语音转换中引入的两个主要挑战是对并行训练数据的依赖性以及说话者相似度和语音质量之间的权衡。为了解决后一个问题,本文提出了一种基于迭代自组织数据分析技术算法(ISODATA)聚类算法的转换方法。特别地,我们在训练高斯混合模型时使用ISODATA,优化的混合数可以保证GMM模型的有效性和准确性,可以有效地获得与原始目标语音和转换后语音的说话人相似性相关的说话人身份,接下来,我们结合在转换阶段改进了GMM和双线性频率扭曲,可以在说话者相似度和语音质量之间取得良好的平衡。理论分析和实验结果表明,与其他两种方法相比,该算法具有更高的质量和相似度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号