GPU-Friendly Local Regression for Voice Conversion

机译：GPU友好的本地回归以进行语音转换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Voice conversion is the task of transforming a source speaker's voice so that it sounds like a target speaker's voice. We present a GPU-friendly local regression model for voice conversion that is capable of converting speech in real-time and achieves state-of-the-art accuracy on this task. Our model uses a new approximation for computing local regression coefficients that is explicitly designed to preserve memory locality. As a result, our inference procedure is amenable to efficient implementation on the GPU. Our approach is more than 10X faster than a highly optimized CPU-based implementation, and is able to convert speech 2.7X faster than real-time.

机译：语音转换是转换源说话者语音以使其听起来像目标说话者语音的任务。我们为语音转换提供了GPU友好的本地回归模型，该模型能够实时转换语音并在此任务上达到最先进的准确性。我们的模型使用一种新的近似值来计算局部回归系数，该近似值已明确设计为保留内存局部性。结果，我们的推理过程适合在GPU上高效实现。我们的方法比高度优化的基于CPU的实现快10倍以上，并且能够将语音转换速度比实时速度快2.7倍。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2015年|1334-1338|共5页
会议地点
作者
Taylor Berg-Kirkpatrick; Dan Klein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Voice conversion using General Regression Neural Network [J] . Jagannath Nirmal, Mukesh Zaveri, Suprava Patnaik, Applied Soft Computing . 2014,第Null期

机译：使用通用回归神经网络进行语音转换
2. Spectral Mapping Using Kernel Principal Components Regression for Voice Conversion [J] . Peng SONGW, Li ZHAO, Yongqiang BAO Archives of acoustics . 2013,第1期

机译：使用内核主成分回归进行语音转换的频谱映射
3. Voice Conversion Using Dynamic Kernel Partial Least Squares Regression [J] . Helander E., Silen H., Virtanen T., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第3期

机译：使用动态核偏最小二乘回归进行语音转换
4. GPU-Friendly Local Regression for Voice Conversion [C] . Taylor Berg-Kirkpatrick, Dan Klein Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2015

机译：GPU友好的语音转换当地回归
5. On nonparametric estimation and inference with censored data, bandwidth selection for local polynomial regression, and subset selection in explanatory regression analyses [D] . Peterson, Derick Randall. 1998

机译：关于非参数估计和删失数据推断，局部多项式回归的带宽选择以及解释性回归分析中的子集选择
6. The return of the Iberian lynx to Portugal: local voices [O] . Margarida Lopes-Fernandes, Clara Espírito-Santo, Amélia Frazão-Moreira 2018

机译：伊比利亚返回葡萄牙：当地的声音
7. Regression Approaches to Voice Quality Controll Based on One-to-Many Eigenvoice Conversion [O] . Kumi Ohta, Yamato Ohtani, Tomoki Toda, 2007

机译：一对多特征语音转换的语音质量控制回归方法

GPU-Friendly Local Regression for Voice Conversion

摘要

著录项

相似文献

相关主题

期刊订阅