Curriculum Learning Based Approaches for Noise Robust Speaker Recognition

Shivesh Ranjan; John H. L. Hansen

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Curriculum Learning Based Approaches for Noise Robust Speaker Recognition

【24h】

Curriculum Learning Based Approaches for Noise Robust Speaker Recognition

机译：基于课程学习的鲁棒说话人识别方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Performance of speaker identification (SID) systems is known to degrade rapidly in the presence of mismatch such as noise and channel degradations. This study introduces a novel class of curriculum learning (CL) based algorithms for noise robust speaker recognition. We introduce CL-based approaches at two stages within a state-of-the-art speaker verification system: at the i-Vector extractor estimation and at the probabilistic linear discriminant (PLDA) back-end. Our proposed CL-based approaches operate by categorizing the available training data into progressively more challenging subsets using a suitable difficulty criterion. Next, the corresponding training algorithms are initialized with a subset that is closest to a clean noise-free set, and progressively moving to subsets that are more challenging for training as the algorithms progress. We evaluate the performance of our proposed approaches on the noisy and severely degraded data from the DARPA RATS SID task, and show consistent and significant improvement across multiple test sets over a baseline SID framework with a standard i-Vector extractor and multisession PLDA-based back-end. We also construct a very challenging evaluation set by adding noise to the NIST SRE 2010 C5 extended condition trials, where our proposed CL-based PLDA is shown to offer significant improvements over a traditional PLDA based back-end.

机译：已知说话人识别（SID）系统的性能会在存在失配（例如噪声和声道降级）的情况下迅速降级。这项研究介绍了一种基于课程学习（CL）的新颖类算法，用于对噪声进行健壮的说话人识别。我们在最先进的说话者验证系统中的两个阶段引入基于CL的方法：在i-Vector提取器估计和概率线性判别（PLDA）后端。我们提出的基于CL的方法通过使用适当的难度标准将可用的训练数据分类为更具挑战性的子集来进行操作。接下来，使用最接近干净无噪声集合的子集初始化相应的训练算法，并随着算法的发展逐渐移至对训练更具挑战性的子集。我们评估了我们提出的方法对DARPA RATS SID任务中嘈杂和严重降级的数据的性能，并在带有标准i-Vector提取器和基于多会话PLDA的back SDA框架的基础上，对多个测试集显示了一致且显着的改进-结束。通过在NIST SRE 2010 C5扩展条件试验中增加噪声，我们还构建了一个极具挑战性的评估集，在该试验中，我们提出的基于CL的PLDA被证明比基于PLDA的传统后端具有显着改进。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2018年第1期|197-210|共14页
作者
Shivesh Ranjan; John H. L. Hansen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Noise measurement; Noise robustness; Speech; Signal to noise ratio; Estimation; Rats;

机译：训练;噪声测量;噪声鲁棒性;语音;信噪比;估计;等级;

相似文献

外文文献
中文文献
专利

1. Curriculum learning based approach for noise robust language identification using DNN with attention [J] . Vuddagiri Ravi Kumar, Vydana Hari Krishna, Vuppala Anil Kumar Expert Systems with Application . 2018,第NOVa期

机译：基于课程学习的DNN噪声鲁棒语言识别方法
2. Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM [J] . Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa Speech Communication . 2007,第6期

机译：通过结合特定于说话人的GMM和适用于说话人的HMM，基于位置相关的CMN进行鲁棒的远方说话人识别
3. Robust Speaker Recognition in Noisy Conditions by Means of Online Training with Noise Profiles [J] . AHMED H. Y AL-NOORI, PHILIP DUNCAN Journal of the Audio Engineering Society . 2019,第4期

机译：噪声条件下的在线培训，可在嘈杂的条件下实现可靠的说话人识别
4. A curriculum learning method for improved noise robustness in automatic speech recognition [C] . Stefan Braun, Daniel Neil, Shih-Chii Liu European Signal Processing Conference . 2017

机译：一种在语音自动识别中提高噪声鲁棒性的课程学习方法
5. Robust speaker recognition in noise by coherent spectral modification. [D] . Ramanujam, Vidhya. 2000

机译：通过相干频谱修改，可以对噪声进行可靠的说话人识别。
6. Cost-Sensitive Learning for Emotion Robust Speaker Recognition [O] . Dongdong Li, Yingchun Yang, Weihui Dai -1

机译：成本敏感型学习可增强情感上的说话人识别能力
7. A curriculum learning method for improved noise robustness in automatic speech recognition [O] . Stefan Braun, Daniel Neil, Shih-Chii Liu 2017

机译：一种改进自动语音识别噪声鲁棒性的课程学习方法
8. Noise-Robust System for NIST 2012 Speaker Recognition Evaluation. [R] . Ferrer, L., McLaren, M., Scheffer, N., 2013

机译：用于NIsT 2012扬声器识别评估的噪声稳健系统。

Curriculum Learning Based Approaches for Noise Robust Speaker Recognition

摘要

著录项

相似文献

相关主题

期刊订阅