首页> 外文会议>2012 Third International Conference on Intelligent Control and Information Processing. >An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures
【24h】

An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

机译:带有条件数阈值自适应阈值的改进BLUES用于分离不确定语音混合物

获取原文
获取原文并翻译 | 示例

摘要

Speech separation has been studied for decades, to which one challenge is the underdetermined problem, where there are more sources than microphones. To solve this problem, Pedersen et al. proposed recently an effective algorithm called BLUES (BLind Underdetermined Extraction of Sources) by combining ICA and time-frequency masking, and it works well on instantaneous/convolutive mixtures of both speech and music. One key ingredient to BLUES is the stopping criterion of the separation process, where the condition number of the outputs is compared with a fixed threshold in the original version. However, as audio recordings are always varying in speech sources and their number, using a fixed threshold would not fit in with these changes, and then deteriorate the overall performance. As such, we propose a threshold update strategy to improve BLUES by adapting the threshold with an increasing rate to find the most suitable condition number. A new criterion based on detection of the number of the sources is then presented to stop the algorithm. The experiments are carried out by using the synthetic and real recorded underdetermined mixtures. The results show that our approach obtains improved performance compared to the original BLUES when the number of the speeches included in the underdetermined mixtures is increased.
机译:语音分离已经研究了数十年,面临的挑战之一是不确定性问题,其中的来源比麦克风更多。为了解决这个问题,Pedersen等人。最近,通过将ICA和时频掩蔽相结合,提出了一种有效的算法,称为BLUES(盲源提取),它在语音和音乐的瞬时/卷积混合中效果很好。 BLUES的一个关键因素是分离过程的停止标准,其中将输出的条件编号与原始版本中的固定阈值进行比较。但是,由于音频记录在语音来源及其数量方面总是变化的,因此使用固定的阈值将无法适应这些变化,从而使总体性能下降。因此,我们提出了一种阈值更新策略,通过以不断增加的速率调整阈值来找到最合适的条件编号,从而改善了BLUES。然后提出了基于对源数目的检测的新准则以停止算法。通过使用合成的和实际记录的不确定混合物进行实验。结果表明,当不确定混音中包含的语音数量增加时,与原始BLUES相比,我们的方法可获得更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号