Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation

Motoi Omachi; Tetsuji Ogawa; Tetsunori Kobayashi

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation

【24h】

Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation

机译：基于关联记忆模型的线性滤波及其在串联连接盲源分离中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a blind source separation method that yields high-quality speech with low distortion. Time-frequency (TF) masking can effectively reduce interference, but it produces nonlinear distortion. By contrast, linear filtering using a separation matrix such as independent vector analysis (IVA) can avoid nonlinear distortion, but the separation performance is reduced under reverberant conditions. The tandem connectionist approach combines several separation methods and it has been used frequently to compensate for the disadvantages of these methods. In this study, we propose associative memory model (AMM)-based linear filtering and a tandem connectionist framework, which applies TF masking followed by linear filtering. By using AMM trained with speech spectra to optimize the separation matrix, the proposed linear filtering method considers the properties of speech that are not considered explicitly in IVA, such as the harmonic components of spectra. TF masking is applied in the proposed tandem connectionist framework to reduce unwanted components that hinder the optimization of the separation matrix, and it is approximated by using a linear separation matrix to reduce nonlinear distortion. The results obtained in simultaneous speech separation experiments demonstrate that although the proposed linear filtering method can increase the signal-to-distortion ratio (SDR) and signal-to-interference ratio (SIR) compared with IVA, the proposed tandem connectionist framework can obtain greater increases in SDR and SIR, and it reduces the phoneme error rate more than the proposed linear filtering method.

机译：我们提出了一种盲源分离方法，该方法可以产生具有低失真的高质量语音。时频（TF）屏蔽可以有效减少干扰，但是会产生非线性失真。相比之下，使用分离矩阵（例如独立向量分析（IVA））进行线性滤波可以避免非线性失真，但是在混响条件下分离性能会降低。串联连接方法结合了几种分离方法，并且已被频繁使用以弥补这些方法的缺点。在这项研究中，我们提出了基于联想记忆模型（AMM）的线性过滤和串联连接框架，该框架将TF屏蔽应用于线性过滤。通过使用经过语音频谱训练的AMM优化分离矩阵，提出的线性滤波方法考虑了IVA中未明确考虑的语音属性，例如频谱的谐波分量。在提出的串联连接器框架中应用了TF屏蔽，以减少妨碍分离矩阵优化的有害成分，并通过使用线性分离矩阵来减少非线性失真来对其进行近似。在同时语音分离实验中获得的结果表明，尽管所提出的线性滤波方法与IVA相比可以提高信号失真比（SDR）和信号干扰比（SIR），但是所提出的串联连接主义框架可以获得更大的优势。 SDR和SIR的增加，比拟议的线性滤波方法更能降低音素错误率。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2017年第3期|637-650|共14页
作者
Motoi Omachi; Tetsuji Ogawa; Tetsunori Kobayashi;
展开▼
作者单位

Department of Computer Science, Waseda University, Tokyo, Japan;

Department of Computer Science, Waseda University, Tokyo, Japan;

Department of Computer Science, Waseda University, Tokyo, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Nonlinear distortion; Interference; Source separation; Speech recognition; Music; Manganese;

机译：语音;非线性失真;干扰;源分离;语音识别;音乐;锰;

相似文献

外文文献
中文文献
专利

1. Optimization and Estimation of Complex-Valued Signals: Theory and applications in filtering and blind source separation [J] . Adali T., Schreier P. Signal Processing Magazine, IEEE . 2014,第5期

机译：复值信号的优化和估计：滤波和盲源分离的理论和应用
2. Blind Compensation of Nonlinear Distortions: Application to Source Separation of Post-Nonlinear Mixtures [J] . Duarte L. T., Suyama R., Rivet B., Signal Processing, IEEE Transactions on . 2012,第11期

机译：非线性失真的盲补偿：在非线性后混合源分离中的应用
3. Design of oversampled generalised discrete fourier transform filter banks for application to subbandbased blind source separation [J] . Peng B., Liu W., Mandic D.P. Signal Processing, IET . 2013,第9期

机译：超采样广义离散傅里叶变换滤波器组的设计应用于基于子带的盲源分离
4. Separation matrix optimization using associative memory model for blind source separation [C] . Omachi Motoi, Ogawa Tetsuji, Kobayashi Tetsunori, European Signal Processing Conference . 2015

机译：使用关联记忆模型的分离矩阵优化用于盲源分离
5. Optimum nonlinearities and approximations for complex blind source separation. [D] . Zhang, Yang. 2012

机译：复杂的盲源分离的最佳非线性和近似值。
6. A Novel Underdetermined Blind Source Separation Method and Its Application to Source Contribution Quantitative Estimation [O] . Jiantao Lu, Wei Cheng, Yanyang Zi 2019

机译：一种不确定的盲源分离新方法及其在源贡献定量估计中的应用
7. A Study on Linear Blind Source Separation using Associative Memory Model [O] . 大町基 2017

机译：基于联想记忆模型的线性盲源分离研究
8. Virtues and Vices of Source Separation Using Linear Independent Component Analysis for Blind Source Separation of Non-Linearly Coupled and Synchronised Fetal and Mother ECGs. [R] . Sabry-Rizk, M., Zgallai, W., McLean, A., 2001

机译：利用线性独立分量分析非线性耦合和同步胎儿和母亲心电图的盲源分离源和分离。

Associative Memory Model-Based Linear Filtering and Its Application to Tandem Connectionist Blind Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅