Speech overlap detection and attribution using convolutive non-negative sparse coding

机译：使用卷积非负稀疏编码进行语音重叠检测和归因

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Overlapping speech is known to degrade speaker diarization performance with impacts on speaker clustering and segmentation. While previous work made important advances in detecting overlapping speech intervals and in attributing them to relevant speakers, the problem remains largely unsolved. This paper reports the first application of convolutive non-negative sparse coding (CNSC) to the overlap problem. CNSC aims to decompose a composite signal into its underlying contributory parts and is thus naturally suited to overlap detection and attribution. Experimental results on NIST RT data show that the CNSC approach gives comparable results to a state-of-the-art hidden Markov model based overlap detector. In a practical diarization system, CNSC based speaker attribution is shown to reduce the speaker error by over 40% relative in overlapping segments.

机译：众所周知，重叠语音会降低说话者的二分音表现，并影响说话者的聚类和分段。尽管先前的工作在检测重叠的语音间隔并将其归因于相关说话者方面取得了重要的进展，但问题仍未解决。本文报道了卷积非负稀疏编码（CNSC）在重叠问题上的首次应用。 CNSC的目的是将复合信号分解成其潜在的贡献部分，因此自然适用于重叠检测和归因。 NIST RT数据上的实验结果表明，CNSC方法与基于最新隐马尔可夫模型的重叠检测器可比。在实际的数字化系统中，基于CNSC的说话者归因显示可将说话者错误相对于重叠部分减少40％以上。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4181- 4184|共4页
会议地点 Kyoto(JP)
作者
Vipperla, Ravichander;
展开▼
作者单位

Multimedia Communications Department Eurecom Sophia Antipolis France;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Non-negative sparse autoencoder neural networks for the detection of overlapping, hierarchical communities in networked datasets [J] . Michele Rajtmajer S., Smith B., Phoha S. Chaos . 2012,第4期

机译：非负稀疏自动编码器神经网络，用于检测网络数据集中的重叠层次社区
2. Discovering Speech Phones Using Convolutive Non-negative Matrix Factorisation With A Sparseness Constraint [J] . Paul D. OGrady, Barak A. Pearlmutter Neurocomputing . 2008,第1a3期

机译：使用具有稀疏约束的卷积非负矩阵分解发现语音电话
3. Convolutional sparse coding with periodic overlapped group sparsity for rolling element bearing fault diagnosis [J] . Xia Yi, Lu Siliang Measurement Science & Technology . 2018,第11期

机译：卷积稀疏编码，具有周期性重叠的组稀疏性，用于滚动元件轴承故障诊断
4. Speech overlap detection and attribution using convolutive non-negative sparse coding [C] . Vipperla R., Geiger J.T., Bozonnet S., IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：使用卷曲非负稀疏编码进行语音重叠检测和归因
5. Modified Viterbi decoders for joint data detection and timing recovery of convolutionally encoded PPM and OPPM optical signals. [D] . Lai, Che-Hsi. 1994

机译：改进的Viterbi解码器，用于联合数据检测和卷积编码的PPM和OPPM光信号的定时恢复。
6. Learning an Efficient Hippocampal Place Map from Entorhinal Inputs Using Non-Negative Sparse Coding [O] . Yanbo Lian, Anthony N. Burkitt 2021

机译：使用非负稀疏编码学习来自Entorlinal输入的高效海马地图
7. Discovering speech phones using convolutive\ud non-negative matrix factorisation with a sparseness constraint [O] . O'Grady, Paul D., Pearlmutter, Barak A. 2008

机译：使用卷积\ ud查找语音电话具有稀疏约束的非负矩阵分解

Speech overlap detection and attribution using convolutive non-negative sparse coding

摘要

著录项

相似文献

相关主题

期刊订阅