首页> 外文会议>International Conference on Imaging, Signal Processing and Communication >On Sparsity of Speech Features with Ladder Autoencoders for Multi-Speaker Separation
【24h】

On Sparsity of Speech Features with Ladder Autoencoders for Multi-Speaker Separation

机译:用梯子自动化器进行多扬声器分离的语音功能稀疏性

获取原文

摘要

The multi-speaker separation mechanism consists of speech feature extraction and temporal coherence. In this study, a speech feature extraction is developed, and the reconstructed-speech quality is evaluated with different degrees of sparsity. Speech feature extraction is implemented on ladder autoencoders with branches embodying a sparse encoder-decoder model where the autoencoders are trained with the WSJ0-2mix English Corpus. An evaluation indicates the stability of the reconstructed-speech quality, with a signal-to-distortion ratio of >5 dB in the sparseness range of 0.4-0.7. The results suggest the applicability of the feature extraction method to the investigation of temporal coherence.
机译:多扬声器分离机构包括语音特征提取和时间相干性。 在该研究中,开发了一种语音特征提取,并用不同程度的稀疏性评估重建语音质量。 语音特征提取在梯形AutoEncoders上实现,其中分支体现了一个稀疏的编码器 - 解码器模型,其中AutoEncoders培训了WSJ0-2Mix英语语料库。 评估表示重建语音质量的稳定性,其稀疏范围在0.4-0.7的稀疏范围内具有> 5 dB的信号 - 失真率。 结果表明特征提取方法适用于对时间相干性的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号