Dual Learning for Machine Translation

机译：机器翻译的双重学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While neural machine translation (NMT) is making good progress in the past two years, tens of millions of bilingual sentence pairs are needed for its training. However, human labeling is very costly. To tackle this training data bottleneck, we develop a dual-learning mechanism, which can enable an NMT system to automatically learn from unlabeled data through a dual-learning game. This mechanism is inspired by the following observation: any machine translation task has a dual task, e.g., English-to-French translation (primal) versus French-to-English translation (dual); the primal and dual tasks can form a closed loop, and generate informative feedback signals to train the translation models, even if without the involvement of a human labeler. In the dual-learning mechanism, we use one agent to represent the model for the primal task and the other agent to represent the model for the dual task, then ask them to teach each other through a reinforcement learning process. Based on the feedback signals generated during this process (e.g., the language-model likelihood of the output of a model, and the reconstruction error of the original sentence after the primal and dual translations), we can iteratively update the two models until convergence (e.g., using the policy gradient methods). We call the corresponding approach to neural machine translation dual-NMT. Experiments show that dual-NMT works very well on English↔French translation; especially, by learning from monolingual data (with 10% bilingual data for warm start), it achieves a comparable accuracy to NMT trained from the full bilingual data for the French-to-English translation task.

机译：在过去两年中，神经机器翻译（NMT）取得了长足的进步，但其培训需要数以千万计的双语句子对。但是，人类标签非常昂贵。为了解决此培训数据瓶颈，我们开发了一种双重学习机制，该机制可使NMT系统能够通过双重学习游戏自动从未标记的数据中学习。该机制是受以下观察启发的：任何机器翻译任务都具有双重任务，例如，英语到法语的翻译（主要）与法语到英语的翻译（双重）;即使没有人工标记，原始任务和双重任务也可以形成一个闭环，并生成有用的反馈信号来训练翻译模型。在双重学习机制中，我们使用一个代理来代表主要任务的模型，并使用另一个代理来代表双重任务的模型，然后要求他们通过强化学习过程互相教导。根据在此过程中生成的反馈信号（例如，模型输出的语言模型可能性以及原始翻译和双重翻译后原始句子的重构错误），我们可以迭代地更新两个模型，直到收敛为止（例如，使用策略梯度方法）。我们将神经机器翻译的相应方法称为双重NMT。实验表明，双重NMT在英语-法语翻译中效果很好。特别是，通过从单语数据（具有10％的双语数据进行热启动）中学习，它可以达到与从法语到英语翻译任务的全部双语数据训练出来的NMT相当的准确性。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|820-828|共9页
会议地点
作者
Di He; Yingce Xia; Tao Qin; Liwei Wang; Nenghai Yu; Tie-Yan Liu; Wei-Ying Ma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market [J] . Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Language Resources and Evaluation . 2006,第3a4期

机译：使用机器学习方法进行时态，方面和情态的日语到英语翻译，并与市场上的机器翻译系统进行比较
2. A Clinically-Translatable Machine Learning Algorithm for the Prediction of Alzheimer's Disease Conversion in Individuals with Mild and Premild Cognitive Impairment [J] . Grassi Massimiliano, Perna Giampaolo, Caldirola Daniela, Journal of Alzheimer's disease: JAD . 2018,第4期

机译：一种临床翻译机器学习算法，用于预测患有轻度和高等认知障碍的个人疾病转化
3. Transforming machine translation: a deep learning system reaches news translation quality comparable to human professionals [J] . Martin Popel, Marketa Tomkova, Jakub Tomek, Nature Communications . 2020,第1期

机译：变换机器翻译：深度学习系统达到了与人类专业人士相当的新闻翻译质量
4. Searching for Poor Quality Machine Translated Text: Learning the Difference between Human Writing and Machine Translations [C] . Dave Carter, Diana Inkpen Advances in artificial intelligence. . 2012

机译：搜索质量差的机器翻译文本：了解人类写作与机器翻译之间的区别
5. Latent Semantic Analysis, Corpus stylistics and Machine Learning Stylometry for Translational and Authorial Style Analysis: The Case of Denys Johnson-Davies' Translations into English. [D] . Al Batineh, Mohammed. 2015

机译：潜在语义分析，语料库样式学和机器学习样式法，用于翻译和作者风格分析：以Denys Johnson-Davies的英语翻译为例。
6. A Clinically-Translatable Machine Learning Algorithm for thePrediction of Alzheimer’s Disease Conversion in Individuals with Mild andPremild Cognitive Impairment [O] . Massimiliano Grassi, Giampaolo Perna, Daniela Caldirola, -1

机译：一种可临床翻译的机器学习算法轻度和轻度人群阿尔茨海默氏病转化的预测轻度认知障碍
7. Searching for poor quality machine translated text : learning the difference between human writing and machine translations [O] . Carter, Dave, Inkpen, Diana 2012

机译：搜索质量差的机器翻译文本：了解人工写作和机器翻译之间的区别

Dual Learning for Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅