...
首页> 外文期刊>Neurocomputing >An unsupervised deep domain adaptation approach for robust speech recognition
【24h】

An unsupervised deep domain adaptation approach for robust speech recognition

机译:一种无监督的深度域自适应方法,可实现可靠的语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the robust speech recognition problem as a domain adaptation task. Specifically, we inttoduce an unsupervised deep domain adaptation (DDA) approach to acoustic modeling in order to eliminate the training-testing mismatch that is common in real-world use of speech recognition. Under a multi-task learning framework, the approach jointly learns two discriminative classifiers using one deep neural network (DNN). As the main task, a label predictor predicts phoneme labels and is used during training and at test time. As the second task, a domain classifier discriminates between the source and the target domains during training. The network is optimized by minimizing the loss of the label classifier and to maximize the loss of the domain classifier at the same time. The proposed approach is easy to implement by modifying a common feed-forward network. Moreover, this unsupervised approach only needs labeled training data from the source domain and some unlabeled raw data of the new domain. Speech recognition experiments on noise/channel distortion and domain shift confirm the effectiveness of the proposed approach. For instance, on the Aurora-4 corpus, compared with the acoustic model trained only using clean data, the DDA approach achieves relative 37.8% word error rate (WER) reduction. (C) 2017 Elsevier B.V. All rights reserved.
机译:本文将鲁棒的语音识别问题作为领域自适应任务来解决。具体来说,我们为声学建模引入了一种无监督的深域自适应(DDA)方法,以消除在现实世界中语音识别中常见的训练测试失配问题。在多任务学习框架下,该方法使用一个深度神经网络(DNN)联合学习两个判别分类器。标签预测器作为主要任务,可以预测音素标签,并在训练期间和测试时间使用。作为第二项任务,领域分类器在训练过程中区分源域和目标域。通过最小化标签分类器的损失并同时最大化域分类器的损失来优化网络。通过修改公共前馈网络,可以轻松实现所提出的方法。而且,这种无监督的方法仅需要来自源域的带标签的训练数据和新域的一些无标签的原始数据。针对噪声/通道失真和域移位的语音识别实验证实了该方法的有效性。例如,在Aurora-4语料库上,与仅使用干净数据训练的声学模型相比,DDA方法可实现37.8%的字错误率(WER)降低。 (C)2017 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2017年第27期|79-87|共9页
  • 作者单位

    Northwestern Polytech Univ, Sch Comp Sci, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian, Peoples R China;

    Northwestern Polytech Univ, Sch Comp Sci, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian, Peoples R China;

    Northwestern Polytech Univ, Sch Comp Sci, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian, Peoples R China;

    Northwestern Polytech Univ, Sch Comp Sci, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Domain adaptation; Robust speech recognition; Deep neural network; Deep learning;

    机译:领域自适应;鲁棒语音识别;深度神经网络;深度学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号