...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Curriculum Learning Based Approaches for Noise Robust Speaker Recognition
【24h】

Curriculum Learning Based Approaches for Noise Robust Speaker Recognition

机译:基于课程学习的鲁棒说话人识别方法

获取原文
获取原文并翻译 | 示例
           

摘要

Performance of speaker identification (SID) systems is known to degrade rapidly in the presence of mismatch such as noise and channel degradations. This study introduces a novel class of curriculum learning (CL) based algorithms for noise robust speaker recognition. We introduce CL-based approaches at two stages within a state-of-the-art speaker verification system: at the i-Vector extractor estimation and at the probabilistic linear discriminant (PLDA) back-end. Our proposed CL-based approaches operate by categorizing the available training data into progressively more challenging subsets using a suitable difficulty criterion. Next, the corresponding training algorithms are initialized with a subset that is closest to a clean noise-free set, and progressively moving to subsets that are more challenging for training as the algorithms progress. We evaluate the performance of our proposed approaches on the noisy and severely degraded data from the DARPA RATS SID task, and show consistent and significant improvement across multiple test sets over a baseline SID framework with a standard i-Vector extractor and multisession PLDA-based back-end. We also construct a very challenging evaluation set by adding noise to the NIST SRE 2010 C5 extended condition trials, where our proposed CL-based PLDA is shown to offer significant improvements over a traditional PLDA based back-end.
机译:已知说话人识别(SID)系统的性能会在存在失配(例如噪声和声道降级)的情况下迅速降级。这项研究介绍了一种基于课程学习(CL)的新颖类算法,用于对噪声进行健壮的说话人识别。我们在最先进的说话者验证系统中的两个阶段引入基于CL的方法:在i-Vector提取器估计和概率线性判别(PLDA)后端。我们提出的基于CL的方法通过使用适当的难度标准将可用的训练数据分类为更具挑战性的子集来进行操作。接下来,使用最接近干净无噪声集合的子集初始化相应的训练算法,并随着算法的发展逐渐移至对训练更具挑战性的子集。我们评估了我们提出的方法对DARPA RATS SID任务中嘈杂和严重降级的数据的性能,并在带有标准i-Vector提取器和基于多会话PLDA的back SDA框架的基础上,对多个测试集显示了一致且显着的改进-结束。通过在NIST SRE 2010 C5扩展条件试验中增加噪声,我们还构建了一个极具挑战性的评估集,在该试验中,我们提出的基于CL的PLDA被证明比基于PLDA的传统后端具有显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号