首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach
【24h】

Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach

机译:使用完全链接聚类方法的多个电话对话的演讲者归因

获取原文

摘要

In this paper we propose and evaluate a speaker attribution system using a complete-linkage clustering method. Speaker attribution refers to the annotation of a collection of spoken audio based on speaker identities. This can be achieved using diarization and speaker linking. The main challenge associated with attribution is achieving computational efficiency when dealing with large audio archives. Traditional agglomerative clustering methods with model merging and retraining are not feasible for this purpose. This has motivated the use of linkage clustering methods without retraining. We first propose a diarization system using complete-linkage clustering and show that it outperforms traditional agglomerative and single-linkage clustering based diarization systems with a relative improvement of 40% and 68%, respectively. We then propose a complete-linkage speaker linking system to achieve attribution and demonstrate a 26% relative improvement in attribution error rate (AER) over the single-linkage speaker linking approach.
机译:在本文中,我们提出并评估了使用完全链接聚类方法的说话者归因系统。说话者归因是指基于说话者身份对语音音频集合的注释。这可以通过使用差异化和说话者链接来实现。与归因相关的主要挑战是在处理大型音频档案时要实现计算效率。为此,传统的带有模型合并和再训练的聚集聚类方法是不可行的。这激励了使用链接聚类方法而无需重新培训。我们首先提出一种使用完全链接聚类的隔离系统,并表明它优于传统的基于聚集和单链接聚类的隔离系统,相对改进分别为40%和68%。然后,我们提出了一种完整的链接说话者链接系统,以实现归因,并证明了归因错误率(AER)相对于单链接说话者链接方法有26%的相对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号