首页> 外文会议>INTERSPEECH 2012 >Boosting Classification Based Speech Separation Using Temporal Dynamics

【24h】

Boosting Classification Based Speech Separation Using Temporal Dynamics

机译：基于分类的语音分离使用时间动态提升

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Significant advances in speech separation have been made by formulating it as a classification problem, where the desired output is the ideal binary mask (IBM). Previous work does not explicitly model the correlation between neighboring time-frequency units and standard binary classifiers are used. As one of the most important characteristics of speech signal is its temporal dynamics, the IBM contains highly structured, instead of, random patterns. In this study, we incorporate temporal dynamics into classification by employing structured output learning. In particular, we use linear-chain structured perceptrons to account for the interactions of neighboring labels in time. However, the performance of structured perceptrons largely depends on the linear separability of features. To address this problem, we employ pre-trained deep neural networks to automatically learn effective feature functions for structured perceptrons. The experiments show that the proposed system significantly outperforms previous IBM estimation systems.

机译：通过将其作为分类问题将其制定为语音分离的显着进展，其中所需的输出是理想的二进制掩模（IBM）。以前的工作没有明确地模拟相邻的时频单元和标准二进制分类器之间的相关性。由于语音信号的最重要特征之一是其时间动态，IBM包含高度结构，而不是随机图案。在这项研究中，我们通过采用结构化输出学习将时间动态纳入分类。特别是，我们使用线性链结构的感觉分布来计算邻近标签及时的相互作用。然而，结构化的感知的性能在很大程度上取决于特征的线性可分离性。为了解决这个问题，我们采用预先训练的深度神经网络，自动学习结构化的感知的有效功能功能。实验表明，所提出的系统显着优于先前的IBM估计系统。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Yuxuan Wang; DeLiang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
Monaural speech separation; temporal dynamics; structured perceptron; deep neural networks;

机译：单声道言语分离;时间动态;结构化的感觉;深神经网络;

相似文献

外文文献
中文文献
专利

1. Integration of iconic gestures and speech in left superior temporal areas boosts speech comprehension under adverse listening conditions. [J] . Holle H, Obleser J, Rueschemeyer S NeuroImage . 2010,第1期

机译：在不利的聆听条件下，左上颞部区域的标志性手势和语音的整合可增强语音理解能力。
2. An approach to blind source separation based on temporal structure of speech signals [J] . Noboru Murata, Shiro Ikeda, Andreas Ziehe Neurocomputing . 2001,第期

机译：基于语音信号时间结构的盲源分离方法
3. Blind separation based on unitary transformation and complex Hermite moments: application to temporal and spatial mixture of speech [J] . N. Nakasako, H. Ogura, H. Kuruuchi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第580期

机译：基于unit变和复杂厄米特矩的盲分离：在语音时空混合中的应用
4. Boosting Classification Based Speech Separation Using Temporal Dynamics [C] . Yuxuan Wang, DeLiang Wang Annual conference of the International Speech Communication Association . 2012

机译：使用时间动力学促进基于分类的语音分离
5. An AdaBoost Based Approach to Automatic Classification and Detection of Buildings Footprints, Vegetation Areas and Roads from Satellite Images [D] . Gonulalan, Cansu 2010

机译：基于AdaBoost的卫星图像自动识别和识别建筑物足迹，植被区域和道路的方法
6. Auditory cortical deactivation during speech production and following speech perception: an EEG investigation of the temporal dynamics of the auditory alpha rhythm [O] . David Jenson, Ashley W. Harkrider, David Thornton, 2015

机译：语音产生和语音感知后听觉皮层失活：听觉α节奏的时间动态的脑电图调查。
7. Improving air quality management using gradient boosting based hierarchical temporal memory neural networks and fuzzy based classification based regression tree [O] . Sagayaraj S, Vetrivelan N 2018

机译：基于梯度升压的分层时间内存神经网络和基于模糊的基于分类的回归树改善空气质量管理
8. Modeling Temporal Dynamics in the Classification of Auditory Signals [R] . Margoliash, D. 1993

机译：听觉信号分类中的时间动态建模

Boosting Classification Based Speech Separation Using Temporal Dynamics

摘要

著录项

相似文献

相关主题

期刊订阅