A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

Bo Wu; Minglei Yang; Kehuang Li; Zhen Huang; Sabato Marco Siniscalchi; Tong Wang; Chin-Hui Lee

首页> 外文期刊>EURASIP journal on advances in signal processing >A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

【24h】

A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

机译：一种利用空间信息进行麦克风阵列去混响的可感知混响时间的DNN方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A reverberation-time-aware deep-neural-network (DNN)-based multi-channel speech dereverberation framework is proposed to handle a wide range of reverberation times (RT60s). There are three key steps in designing a robust system. First, to accomplish simultaneous speech dereverberation and beamforming, we propose a framework, namely DNNSpatial, by selectively concatenating log-power spectral (LPS) input features of reverberant speech from multiple microphones in an array and map them into the expected output LPS features of anechoic reference speech based on a single deep neural network (DNN). Next, the temporal auto-correlation function of received signals at different RT60s is investigated to show that RT60-dependent temporal-spatial contexts in feature selection are needed in the DNNSpatial training stage in order to optimize the system performance in diverse reverberant environments. Finally, the RT60 is estimated to select the proper temporal and spatial contexts before feeding the log-power spectrum features to the trained DNNs for speech dereverberation. The experimental evidence gathered in this study indicates that the proposed framework outperforms the state-of-the-art signal processing dereverberation algorithm weighted prediction error (WPE) and conventional DNNSpatial systems without taking the reverberation time into account, even for extremely weak and severe reverberant conditions. The proposed technique generalizes well to unseen room size, array geometry and loudspeaker position, and is robust to reverberation time estimation error.

机译：提出了一种基于混响时间感知的深度神经网络（DNN）多通道语音混响框架，以处理多种混响时间（RT60s）。设计健壮的系统需要三个关键步骤。首先，为了完成同时的语音去混响和波束成形，我们通过选择性地串联来自阵列中多个麦克风的混响语音的对数功率谱（LPS）输入特征，并将它们映射到无回声的预期输出LPS特征中，提出了一个框架，即DNNSpatial。基于单个深度神经网络（DNN）的参考语音。接下来，研究了不同RT60处接收信号的时间自相关函数，以显示在DNNSpatial训练阶段中，在特征选择中需要依赖RT60的时空上下文，以便在各种混响环境中优化系统性能。最后，在将对数功率谱特征馈送到经过训练的DNN进行语音去混响之前，估计RT60选择适当的时间和空间上下文。这项研究中收集到的实验证据表明，即使在极弱和严重混响的情况下，所提出的框架在不考虑混响时间的情况下也优于最新的信号处理混响算法加权预测误差（WPE）和常规DNNSpatial系统。条件。所提出的技术很好地概括了看不见的房间大小，阵列几何形状和扬声器位置，并且对混响时间估计误差具有鲁棒性。

著录项

来源
《EURASIP journal on advances in signal processing》 |2017年第1期|共页
作者
Bo Wu; Minglei Yang; Kehuang Li; Zhen Huang; Sabato Marco Siniscalchi; Tong Wang; Chin-Hui Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类通信;
关键词
Deep neural networks (DNNs)Simultaneous speech dereverberation and beamformingAuto-correlation functionTemporal and spatial contextsReverberation-time-aware (RTA);

机译：深层神经网络（DNN）同时语音去混响和波束形成自动相关功能时空上下文混响时间感知（RTA）;

相似文献

外文文献
中文文献
专利

1. A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks [J] . Bo Wu, Kehuang Li, Minglei Yang, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第1期

机译：基于深度神经网络的混响时间感知语音去混响方法
2. Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array [J] . Longbiao Wang, Kyohei Odani, Atsuhiko Kai EURASIP journal on advances in signal processing . 2012,第1期

机译：基于小规模麦克风阵列的多通道LMS算法基于广义谱减法的混响和去噪
3. Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering [J] . Marro C., Mahieux Y. IEEE Transactions on Speech and Audio Proceeding . 1998,第3期

机译：基于带有后置滤波的麦克风阵列的降噪和去混响技术分析
4. A New Approach to Dereverberation and Noise Reduction with Microphone Arrays [C] . J.L. Sanchez-Bote, J. Gonzalez-Rodriguez, J. Ortega-Garcia Signal Processing X: Theories and Applications . 2000

机译：麦克风阵列去混响和降噪的新方法
5. Blind adaptive dereverberation of speech signals using a microphone array. [D] . Bakir, Tariq Saad. 2004

机译：使用麦克风阵列进行语音信号的盲自适应去混响。
6. Selective-Tap Blind Dereverberation for Two-Microphone Enhancement of Reverberant Speech [O] . Kostas Kokkinakis, Philipos C. Loizou -1

机译：选择性抽头盲混响去除的混响语音的双麦克风增强
7. DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays [O] . Nicolas Furnon, Romain Serizel, Slim Essid, 2021

机译：基于DNN的掩模估计用于空间无约束麦克风阵列的分布式语音增强

A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

摘要

著录项

相似文献

相关主题

期刊订阅