An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

机译：带有条件数阈值自适应阈值的改进BLUES用于分离不确定语音混合物

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech separation has been studied for decades, to which one challenge is the underdetermined problem, where there are more sources than microphones. To solve this problem, Pedersen et al. proposed recently an effective algorithm called BLUES (BLind Underdetermined Extraction of Sources) by combining ICA and time-frequency masking, and it works well on instantaneous/convolutive mixtures of both speech and music. One key ingredient to BLUES is the stopping criterion of the separation process, where the condition number of the outputs is compared with a fixed threshold in the original version. However, as audio recordings are always varying in speech sources and their number, using a fixed threshold would not fit in with these changes, and then deteriorate the overall performance. As such, we propose a threshold update strategy to improve BLUES by adapting the threshold with an increasing rate to find the most suitable condition number. A new criterion based on detection of the number of the sources is then presented to stop the algorithm. The experiments are carried out by using the synthetic and real recorded underdetermined mixtures. The results show that our approach obtains improved performance compared to the original BLUES when the number of the speeches included in the underdetermined mixtures is increased.

机译：语音分离已经研究了数十年，面临的挑战之一是不确定性问题，其中的来源比麦克风更多。为了解决这个问题，Pedersen等人。最近，通过将ICA和时频掩蔽相结合，提出了一种有效的算法，称为BLUES（盲源提取），它在语音和音乐的瞬时/卷积混合中效果很好。 BLUES的一个关键因素是分离过程的停止标准，其中将输出的条件编号与原始版本中的固定阈值进行比较。但是，由于音频记录在语音来源及其数量方面总是变化的，因此使用固定的阈值将无法适应这些变化，从而使总体性能下降。因此，我们提出了一种阈值更新策略，通过以不断增加的速率调整阈值来找到最合适的条件编号，从而改善了BLUES。然后提出了基于对源数目的检测的新准则以停止算法。通过使用合成的和实际记录的不确定混合物进行实验。结果表明，当不确定混音中包含的语音数量增加时，与原始BLUES相比，我们的方法可获得更好的性能。

著录项

来源
《2012 Third International Conference on Intelligent Control and Information Processing.》|2012年|p.694- 698|共5页
会议地点 Dalian(CN);Dalian(CN)
作者
Guo Tiying; Lin Qiuhua; Gong Xiaofeng;
展开▼
作者单位

School of Information and Communication Engineering, Dalian University of Technology, Dalian 116024, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Blind separation of underdetermined Convolutive speech mixtures by time-frequency masking with the reduction of musical noise of separated signals [J] . Zohrevandi Mahbanou, Setayeshi Saeed, Rabiee Azam, Multimedia Tools and Applications . 2021,第8期

机译：通过时频掩模在减少分离信号的音乐噪声的时频掩模盲分离
2. Blind Techniques for Improving Speech Mixtures Using an Adaptive Method [J] . Jasmine J.C. Sheeja, B. Sankara Gomathi Research Journal of Applied Sciences: RJAS . 2014,第5期

机译：使用自适应方法改善语音混合的盲技术
3. Improved density peak clustering-based adaptive Gaussian mixture model for damage monitoring in aircraft structures under time-varying conditions [J] . Qiu Lei, Fang Fang, Yuan Shenfang Mechanical systems and signal processing . 2019,第JULa1期

机译：改进的基于密度峰值聚类的自适应高斯混合模型，用于时变条件下飞机结构的损伤监测
4. An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures [C] . Guo Tiying, Lin Qiuhua, Gong Xiaofeng International Conference on Intelligent Control and Information Processing . 2012

机译：具有用于分离有未确定的语音混合物的条件号的自适应阈值的改进的蓝色
5. Improving Enrichment Strategies in Outcome-Dependent Sampling Designs and Adaptive Biomarker-Threshold Designs [D] . Wang, Ting. 2020

机译：改善依赖于不同的采样设计和自适应生物标志物阈值设计中的浓缩策略
6. Extended high-frequency bandwidth improves reception of speech in spatially separated masking speech [O] . Suzanne Carr Levy, Daniel J. Freed, Michael Nilsson, -1

机译：扩展的高频带宽改善了在空间上分离的掩蔽语音中的语音接收
7. Separating underdetermined convolutive speech mixtures [O] . Michael Syskind Pedersen, Deliang Wang, Ulrik Kjems 2008

机译：分离不确定的卷积语音混合物

An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

摘要

著录项

相似文献

相关主题

期刊订阅