首页> 外文会议>Annual conference of the International Speech Communication Association >Selection of TDOA Parameters for MDM Speaker Diarization
【24h】

Selection of TDOA Parameters for MDM Speaker Diarization

机译:TDOA参数的选择,以实现MDM扬声器立体声

获取原文

摘要

Several methods to improve multiple distant microphone (MDM) speaker diarization based on Time Delay of Arrival (TDOA) features are evaluated in this paper. All of them avoid the use of a single reference channel to calculate the TDOA values and, based on different criteria, select among all possible pairs of microphones a set of pairs that will be used to estimate the TDOA's. The evaluated methods have been named the "Dynamic Margin" (DM), the "Extreme Regions" (ER), the "Most Common" (MC), the "Cross Correlation" (XCorr) and the "Principle Component Analysis" (PCA). It is shown that all methods improve the baseline results for the development set and four of them improve also the results for the evaluation set. Improvements of 3.49% and 10.77% DER relative are obtained for DM and ER respectively for the test set. The XCorr and PCA methods achieve an improvement of 36.72% and 30.82% DER relative for the test set. Moreover, the computational cost for the XCorr method is 20% less than the baseline.
机译:本文评估了几种基于到达时间延迟(TDOA)功能来改善多距离麦克风(MDM)扬声器音质的方法。所有这些都避免了使用单个参考通道来计算TDOA值,并且基于不同的标准,在所有可能的麦克风对中选择将用于估计TDOA的一对对。评估的方法已命名为“动态裕度”(DM),“极端区域”(ER),“最常见”(MC),“交叉相关”(XCorr)和“原理成分分析”(PCA) )。结果表明,所有方法都可以改善开发集的基线结果,其中四种方法也可以改善评估集的结果。对于测试集,DM和ER分别获得了3.49%和10.77%的DER相对改进。相对于测试集,XCorr和PCA方法实现了DER的36.72%和30.82%的改进。此外,XCorr方法的计算成本比基线少20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号