...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation
【24h】

Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation

机译:基于关联记忆模型的线性滤波及其在串联连接盲源分离中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a blind source separation method that yields high-quality speech with low distortion. Time-frequency (TF) masking can effectively reduce interference, but it produces nonlinear distortion. By contrast, linear filtering using a separation matrix such as independent vector analysis (IVA) can avoid nonlinear distortion, but the separation performance is reduced under reverberant conditions. The tandem connectionist approach combines several separation methods and it has been used frequently to compensate for the disadvantages of these methods. In this study, we propose associative memory model (AMM)-based linear filtering and a tandem connectionist framework, which applies TF masking followed by linear filtering. By using AMM trained with speech spectra to optimize the separation matrix, the proposed linear filtering method considers the properties of speech that are not considered explicitly in IVA, such as the harmonic components of spectra. TF masking is applied in the proposed tandem connectionist framework to reduce unwanted components that hinder the optimization of the separation matrix, and it is approximated by using a linear separation matrix to reduce nonlinear distortion. The results obtained in simultaneous speech separation experiments demonstrate that although the proposed linear filtering method can increase the signal-to-distortion ratio (SDR) and signal-to-interference ratio (SIR) compared with IVA, the proposed tandem connectionist framework can obtain greater increases in SDR and SIR, and it reduces the phoneme error rate more than the proposed linear filtering method.
机译:我们提出了一种盲源分离方法,该方法可以产生具有低失真的高质量语音。时频(TF)屏蔽可以有效减少干扰,但是会产生非线性失真。相比之下,使用分离矩阵(例如独立向量分析(IVA))进行线性滤波可以避免非线性失真,但是在混响条件下分离性能会降低。串联连接方法结合了几种分离方法,并且已被频繁使用以弥补这些方法的缺点。在这项研究中,我们提出了基于联想记忆模型(AMM)的线性过滤和串联连接框架,该框架将TF屏蔽应用于线性过滤。通过使用经过语音频谱训练的AMM优化分离矩阵,提出的线性滤波方法考虑了IVA中未明确考虑的语音属性,例如频谱的谐波分量。在提出的串联连接器框架中应用了TF屏蔽,以减少妨碍分离矩阵优化的有害成分,并通过使用线性分离矩阵来减少非线性失真来对其进行近似。在同时语音分离实验中获得的结果表明,尽管所提出的线性滤波方法与IVA相比可以提高信号失真比(SDR)和信号干扰比(SIR),但是所提出的串联连接主义框架可以获得更大的优势。 SDR和SIR的增加,比拟议的线性滤波方法更能降低音素错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号