首页> 外文会议>IEEE Workshop on Applications of Signal Processing to Audio and Acoustics >SPEECH-TO-SINGING SYNTHESIS: CONVERTING SPEAKING VOICES TO SINGING VOICES BY CONTROLLING ACOUSTIC FEATURES UNIQUE TO SINGING VOICES
【24h】

SPEECH-TO-SINGING SYNTHESIS: CONVERTING SPEAKING VOICES TO SINGING VOICES BY CONTROLLING ACOUSTIC FEATURES UNIQUE TO SINGING VOICES

机译:演讲歌唱综合:通过控制独特的声音独特的声学功能转换说话的声音来唱歌

获取原文

摘要

This paper describes a speech-to-singing synthesis system that can synthesize a singing voice, given a speaking voice reading the lyrics of a song and its musical score. The system is based on the speech manipulation system STRAIGHT sad comprises three models controlling three acoustic features unique to singing voices: the fundamental frequency (F0), phoneme duration, and spectrum. Given the musical score and its tempo, the F0 control model generates the F0 contour of the singing voice by controlling four types of F0 fluctuations: overshoot, vibrato, preparation, and fine fluctuation. The duration control model lengthens the duration of each phoneme in the speaking voice by considering the duration of its musical note. The spectral control model converts the spectral envelope of the speaking voice into that of the singing voice by controlling both the singing formant and the amplitude modulation of formants in synchronization with vibrato. Experimental results show that the proposed system can convert speaking voices into singing voices whose naturalness is almost the same as actual singing voices.
机译:本文介绍了一个歌唱综合系统,可以综合歌唱声音,给出一个歌声阅读歌曲的歌词及其音乐分数。该系统基于语音操作系统直接悲伤,包括三种模型,控制三个声学功能独特的唱歌语言:基频(F0),音素持续时间和光谱。鉴于音乐分数及其节奏,F0控制模型通过控制四种类型的F0波动产生歌声的F0轮廓:过冲,振动,制备和细小波动。持续时间控制模型通过考虑其音符的持续时间,延长说话的声音中的每个音素的持续时间。光谱控制模型通过控制与颤音同步的唱片格式和形成的唱片汉族人和振幅调制来将讲声声音的光谱包络转换为歌唱声音的光谱包络。实验结果表明,该建议的系统可以将口感转换为唱歌的声音,其自然的性别几乎与实际的歌唱声音相同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号