首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array
【24h】

Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array

机译:使用麦克风阵列将基于统计模型的语音增强与无监督噪声PSD估计进行实时集成

获取原文

摘要

We propose a technique of multi-channel speech enhancement based on integration of beamforming and statistical model-based speech enhancement to clearly extract the target speech, even in very noisy environments. Conventional microphone array-based techniques estimate speech and noise power spectral densities (PSDs) from the spatial cues of the sound sources; however, their estimation errors dramatically increase when there are many noise sources. We integrated clean speech models trained in advance and the noise PSDs estimated in beamspace to compose observation models and designed a precise Wiener filter. Experiments under adverse noise conditions showed that the proposed technique significantly improved the signal-to-noise ratios (SNRs) compared with the conventional microphone array processing technique.
机译:我们提出了一种基于波束形成和基于统计模型的语音增强集成的多通道语音增强技术,即使在嘈杂的环境中也可以清晰地提取目标语音。基于传统麦克风阵列的技术会根据声源的空间提示来估计语音和噪声功率谱密度(PSD)。但是,当有许多噪声源时,它们的估计误差会急剧增加。我们集成了预先训练的干净语音模型和波束空间中估计的噪声PSD,以构成观察模型,并设计了精确的Wiener滤波器。在不利的噪声条件下进行的实验表明,与传统的麦克风阵列处理技术相比,所提出的技术显着提高了信噪比(SNR)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号