【24h】

GPU-Friendly Local Regression for Voice Conversion

机译:GPU友好的本地回归以进行语音转换

获取原文

摘要

Voice conversion is the task of transforming a source speaker's voice so that it sounds like a target speaker's voice. We present a GPU-friendly local regression model for voice conversion that is capable of converting speech in real-time and achieves state-of-the-art accuracy on this task. Our model uses a new approximation for computing local regression coefficients that is explicitly designed to preserve memory locality. As a result, our inference procedure is amenable to efficient implementation on the GPU. Our approach is more than 10X faster than a highly optimized CPU-based implementation, and is able to convert speech 2.7X faster than real-time.
机译:语音转换是转换源说话者语音以使其听起来像目标说话者语音的任务。我们为语音转换提供了GPU友好的本地回归模型,该模型能够实时转换语音并在此任务上达到最先进的准确性。我们的模型使用一种新的近似值来计算局部回归系数,该近似值已明确设计为保留内存局部性。结果,我们的推理过程适合在GPU上高效实现。我们的方法比高度优化的基于CPU的实现快10倍以上,并且能够将语音转换速度比实时速度快2.7倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号