首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Two-microphone source separation algorithm based on statistical modeling of angle distributions
【24h】

Two-microphone source separation algorithm based on statistical modeling of angle distributions

机译:基于角度分布统计建模的两麦克风源分离算法

获取原文

摘要

In this paper we present a novel two-microphone sound source separation algorithm, which selects speech from the target speaker while suppressing signals from interfering sources. In this algorithm, which is refered to as SMAD-CW, we first estimate the direction of sound sources for each time-frequency bin using phase differences in the spectral domain. For each frame we assume that the angle distribution is a mixture of two distributions, one from the target and the other from the dominant noise source. For each mixture component we use the von Mises distribution, which is a close approximation to the wrapped normal distribution. The expectation-maximization (EM) algorithm is employed to obtain parameters of this mixture distribution. Using this statistical model, we perform maximum a posteriori (MAP) hypothesis testing in order to obtain appropriate binary masks. We demonstrate that the algorithm described in this paper provides speech recognition accuracy that is significantly better than that obtained using conventional approaches.
机译:在本文中,我们提出了一种新颖的两麦克风声源分离算法,该算法从目标扬声器中选择语音,同时抑制来自干扰源的信号。在该算法(称为SMAD-CW)中,我们首先使用频谱域中的相位差来估计每个时频仓的声源方向。对于每一帧,我们假定角度分布是两种分布的混合,一种来自目标,另一种来自主要噪声源。对于每个混合成分,我们使用冯·米塞斯分布,该分布非常接近包裹的正态分布。期望最大化(EM)算法用于获得此混合物分布的参数。使用此统计模型,我们执行最大后验(MAP)假设检验,以获得适当的二进制掩码。我们证明了本文描述的算法提供的语音识别准确度明显优于使用常规方法获得的语音识别准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号