【24h】

Dual Learning for Machine Translation

机译:机器翻译的双重学习

获取原文

摘要

While neural machine translation (NMT) is making good progress in the past two years, tens of millions of bilingual sentence pairs are needed for its training. However, human labeling is very costly. To tackle this training data bottleneck, we develop a dual-learning mechanism, which can enable an NMT system to automatically learn from unlabeled data through a dual-learning game. This mechanism is inspired by the following observation: any machine translation task has a dual task, e.g., English-to-French translation (primal) versus French-to-English translation (dual); the primal and dual tasks can form a closed loop, and generate informative feedback signals to train the translation models, even if without the involvement of a human labeler. In the dual-learning mechanism, we use one agent to represent the model for the primal task and the other agent to represent the model for the dual task, then ask them to teach each other through a reinforcement learning process. Based on the feedback signals generated during this process (e.g., the language-model likelihood of the output of a model, and the reconstruction error of the original sentence after the primal and dual translations), we can iteratively update the two models until convergence (e.g., using the policy gradient methods). We call the corresponding approach to neural machine translation dual-NMT. Experiments show that dual-NMT works very well on English↔French translation; especially, by learning from monolingual data (with 10% bilingual data for warm start), it achieves a comparable accuracy to NMT trained from the full bilingual data for the French-to-English translation task.
机译:在过去两年中,神经机器翻译(NMT)取得了长足的进步,但其培训需要数以千万计的双语句子对。但是,人类标签非常昂贵。为了解决此培训数据瓶颈,我们开发了一种双重学习机制,该机制可使NMT系统能够通过双重学习游戏自动从未标记的数据中学习。该机制是受以下观察启发的:任何机器翻译任务都具有双重任务,例如,英语到法语的翻译(主要)与法语到英语的翻译(双重);即使没有人工标记,原始任务和双重任务也可以形成一个闭环,并生成有用的反馈信号来训练翻译模型。在双重学习机制中,我们使用一个代理来代表主要任务的模型,并使用另一个代理来代表双重任务的模型,然后要求他们通过强化学习过程互相教导。根据在此过程中生成的反馈信号(例如,模型输出的语言模型可能性以及原始翻译和双重翻译后原始句子的重构错误),我们可以迭代地更新两个模型,直到收敛为止(例如,使用策略梯度方法)。我们将神经机器翻译的相应方法称为双重NMT。实验表明,双重NMT在英语-法语翻译中效果很好。特别是,通过从单语数据(具有10%的双语数据进行热启动)中学习,它可以达到与从法语到英语翻译任务的全部双语数据训练出来的NMT相当的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号