On Sparsity of Speech Features with Ladder Autoencoders for Multi-Speaker Separation

机译：用梯子自动化器进行多扬声器分离的语音功能稀疏性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The multi-speaker separation mechanism consists of speech feature extraction and temporal coherence. In this study, a speech feature extraction is developed, and the reconstructed-speech quality is evaluated with different degrees of sparsity. Speech feature extraction is implemented on ladder autoencoders with branches embodying a sparse encoder-decoder model where the autoencoders are trained with the WSJ0-2mix English Corpus. An evaluation indicates the stability of the reconstructed-speech quality, with a signal-to-distortion ratio of >5 dB in the sparseness range of 0.4-0.7. The results suggest the applicability of the feature extraction method to the investigation of temporal coherence.

机译：多扬声器分离机构包括语音特征提取和时间相干性。在该研究中，开发了一种语音特征提取，并用不同程度的稀疏性评估重建语音质量。语音特征提取在梯形AutoEncoders上实现，其中分支体现了一个稀疏的编码器 - 解码器模型，其中AutoEncoders培训了WSJ0-2Mix英语语料库。评估表示重建语音质量的稳定性，其稀疏范围在0.4-0.7的稀疏范围内具有> 5 dB的信号 - 失真率。结果表明特征提取方法适用于对时间相干性的研究。

著录项

来源
《International Conference on Imaging, Signal Processing and Communication》|2020年|39-43|共5页
会议地点
作者
Hiroshi Sekiguchi; Yoshiaki Narusue; Hiroyuki Morikawa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computational modeling; Sociology; Coherence; Feature extraction; Brain modeling; Stability analysis;

机译：培训;计算建模;社会学;一致性;特征提取;脑建模;稳定性分析;

相似文献

外文文献
中文文献
专利

1. Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network [J] . Shanfa Ke, Ruimin Hu, Xiaochen Wang, Multimedia Tools and Applications . 2020,第43a44期

机译：基于量化比率掩模和残差网络的单通道多扬声器语音分离
2. Principles and Typical Computational Limitations of Sparse Speaker Separation Based on Deterministic Speech Features [J] . Albert Kern, Ruedi Stoop Neural computation . 2011,第9期

机译：基于确定性语音特征的稀疏说话人分离的原理和典型计算局限性
3. Boosting sparsity-induced autoencoder: A novel sparse feature ensemble learning for image classification [J] . Rui Shi, Jian Ji, Chunhui Zhang, International Journal of Advanced Robotic Systems . 2019,第3期

机译：增强稀疏性诱导的自动编码器：一种新颖的稀疏特征集成学习，用于图像分类
4. Robust distributed sparsity-constrained non-negative source separation and multi-speaker voice activity detection for speech enhancement in wireless acoustic sensor networks [C] . L. Khadidja Hamaidi, Michael Muma, Abdelhak M. Zoubir International Conference on Signals and Systems . 2018

机译：稳健的分布式稀疏约束非负源分离和多扬声器语音活动检测，用于无线声学传感器网络中的语音增强
5. The Online Adjustment of Speaker-Specific Phonetic Beliefs in Multi-Speaker Speech Perception [D] . Lai, Wei. 2021

机译：在多扬声器语音感知中的发言者特定语音信念的在线调整
6. Defect-Repairable Latent Feature Extraction of Driving Behavior via a Deep Sparse Autoencoder [O] . Hailong Liu, Tadahiro Taniguchi, Kazuhito Takenaka, 2018

机译：通过深度稀疏自动编码器提取驾驶行为的可修复潜在特征
7. Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition [O] . Jun Deng, Zixing Zhang, Erik Marchi, 2013

机译：基于稀疏的AutoEncoder的功能转移学习语音情感识别

On Sparsity of Speech Features with Ladder Autoencoders for Multi-Speaker Separation

摘要

著录项

相似文献

相关主题

期刊订阅