首页> 外国专利> VOICE SEPARATION DEVICE, VOICE SEPARATION METHOD, VOICE SEPARATION PROGRAM, AND VOICE SEPARATION SYSTEM

VOICE SEPARATION DEVICE, VOICE SEPARATION METHOD, VOICE SEPARATION PROGRAM, AND VOICE SEPARATION SYSTEM

机译:语音分离装置,语音分离方法,语音分离程序和语音分离系统

摘要

A voice separation device (12) of this voice separation system is provided with: a feature amount extraction unit (121) that extracts time-series data of a voice feature amount of mixed voices; a block separation unit (122) that separates the voice feature amount time-series data into blocks having a certain time width; a voice separation neural network (1b) that creates, from the voice feature amount time-series data blocks, time-series data of a mask for each of a plurality of speakers; and a voice restoration unit (123) that restores the voice data of each of the plurality of speakers from the mask time-series data and voice feature amount time-series data of the mixed voices. In the creation of the mask time-series data for each of the plurality of speakers, the voice separation neural network (1b) uses time-series data of a block which temporally precedes the present time, in a forward-direction LSTM neural network, and uses time-series data of a block constructed by a predetermined number of frames which temporally succeed the present time, in a reverse-direction LSTM neural network.
机译:该声音分离系统的声音分离装置(12)包括:特征量提取单元(121),其提取混合语音的语音特征量的时间序列数据;以及特征量提取单元(121)。块分离单元(122),将语音特征量时间序列数据分离为具有一定时间宽度的块;语音分离神经网络(1b),其从语音特征量时间序列数据块中为多个扬声器中的每一个创建掩模的时间序列数据;语音恢复单元(123),从混合语音的掩码时间序列数据和语音特征量时间序列数据恢复多个扬声器中的每个扬声器的语音数据。在为多个扬声器中的每个扬声器创建掩码时间序列数据时,语音分离神经网络(1b)在正向LSTM神经网络中使用时间在当前时间之前的块的时间序列数据,并在反向LSTM神经网络中使用由预定数量的帧构成的块的时序数据,这些帧暂时在当前时间后继。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号