...
首页> 外文期刊>The Journal of the Acoustical Society of America >Impact of phase estimation on single-channel speech separation based on time-frequency masking
【24h】

Impact of phase estimation on single-channel speech separation based on time-frequency masking

机译:相位估计对基于时频掩蔽的单通道语音分离的影响

获取原文
获取原文并翻译 | 示例
           

摘要

Time-frequency masking is a common solution for the single-channel source separation (SCSS) problem where the goal is to find a time-frequency mask that separates the underlying sources from an observed mixture. An estimated mask is then applied to the mixed signal to extract the desired signal. During signal reconstruction, the time-frequency-masked spectral amplitude is combined with the mixture phase. This article considers the impact of replacing the mixture spectral phase with an estimated clean spectral phase combined with the estimated magnitude spectrum using a conventional model-based approach. As the proposed phase estimator requires estimated fundamental frequency of the underlying signal from the mixture, a robust pitch estimator is proposed. The upper-bound clean phase results show the potential of phase-aware processing in single-channel source separation. Also, the experiments demonstrate that replacing the mixture phase with the estimated clean spectral phase consistently improves perceptual speech quality, predicted speech intelligibility, and source separation performance across all signal-to-noise ratio and noise scenarios. (C) 2017 Acoustical Society of America.
机译:时频屏蔽是用于单通道源分离(SCSS)问题的通用解决方案,其中目标是找到将底层源与观察到的混合物分开的时频掩模。然后将估计的掩模应用于混合信号以提取所需信号。在信号重建期间,时间频率屏蔽光谱幅度与混合相结合。本文考虑使用常规模型的方法与估计的清洁光谱相结合估计的清洁光谱相结合估计的幅度谱的影响。由于所提出的相位估计器需要来自混合物的底层信号的估计频率,提出了一种坚固的音高估计器。上限的清洁阶段结果显示了单通道源分离中相位感知处理的可能性。此外,实验表明,用估计的清洁谱相替换混合相位始终如一地改善了所有信噪比和噪声场景的感知语音质量,预测的语音可懂度和源分离性能。 (c)2017年声学社会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号