On Using Very Large Target Vocabulary for Neural Machine Translation

机译：关于使用非常大的目标词汇进行神经机器翻译

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase proportionally to the number of target words. In this paper, we propose a method based on importance sampling that allows us to use a very large target vocabulary without increasing training complexity. We show that decoding can be efficiently done even with the model having a very large target vocabulary by selecting only a small subset of the whole target vocabulary. The models trained by the proposed approach are empirically found to match, and in some cases outperform, the baseline models with a small vocabulary as well as the LSTM-based neural machine translation models. Furthermore, when we use an ensemble of a few models with very large target vocabularies, we achieve performance comparable to the state of the art (measured by BLEU) on both the English→German and English→French translation tasks of WMT' 14.

机译：神经机器翻译，一种最近提出的纯粹基于神经网络的机器翻译方法，与现有的基于短语的统计机器翻译方法相比，已经显示出令人鼓舞的结果。尽管最近取得了成功，但是神经机器翻译在处理更大的词汇量方面存在局限性，因为训练复杂度以及解码复杂度与目标单词的数量成比例地增加。在本文中，我们提出了一种基于重要性采样的方法，该方法允许我们使用非常大的目标词汇表而不会增加训练的复杂性。我们表明，即使模型具有很大的目标词汇量，也可以通过仅选择整个目标词汇量的一小部分来有效地完成解码。从经验上发现，通过提议的方法训练的模型与带有少量词汇的基线模型以及基于LSTM的神经机器翻译模型相匹配，并且在某些情况下甚至优于。此外，当我们使用几个具有很大目标词汇量的模型的集合时，在WMT'14的英语→德语和英语→法语翻译任务上，我们都可以达到与现有技术水平相当的性能（由BLEU衡量）。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages processing》|2015年|1-10|共10页
会议地点
作者
Sebastien Jean; Kyunghyun Cho; Roland Memisevic; Yoshua Bengio;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Addressing Limited Vocabulary and Long Sentences Constraints in English–Arabic Neural Machine Translation [J] . Safae Berrichi, Azzeddine Mazroui Arabian Journal for Science and Engineering . 2021,第9期

机译：在英语 - 阿拉伯语神经机翻译中解决有限的词汇和长句子约束
2. Linguistic knowledge-based vocabularies for Neural Machine Translation [J] . Noe Casas, Marta R. Costa-jussa, Jose A. R. Fonollosa, Natural language engineering . 2021,第Pta4期

机译：基于语言知识的神经机翻译词汇表
3. Neural Machine Translation with Target-Attention Model [J] . Mingming YANG, Min ZHANG, Kehai CHEN, IEICE transactions on information and systems . 2020,第3期

机译：具有目标注意模型的神经机翻译
4. On Using Very Large Target Vocabulary for Neural Machine Translation [C] . Sebastien Jean, Kyunghyun Cho, Roland Memisevic, Annual meeting of the Association for Computational Linguistics . 2015

机译：用非常大的目标词汇对神经机翻译
5. Style and interpretation: A comparison of target languages concerning the verbal aspect with the aid of Bible translations. Can new perspectives be found for the science of translation by comparative studies of target languages? [D] . Wirt, Heinz Peter. 1998

机译：风格和诠释：借助圣经翻译，比较目标语言在语言方面的表现。通过对目标语言的比较研究，可以为翻译科学找到新的观点吗？
6. When a Text Is Translated Does the Complexity of Its Vocabulary Change? Translations and Target Readerships [O] . Hênio Henrique Aragão Rêgo, Lidia A. Braunstein, Gregorio D′Agostino, 2010

机译：翻译文本时，其词汇的复杂性是否会改变？翻译和目标读者
7. On Using Very Large Target Vocabulary for Neural Machine Translation [O] . Jean, Sébastien, Cho, Kyunghyun, Memisevic, Roland, 2015

机译：利用超大目标词汇进行神经网络翻译

On Using Very Large Target Vocabulary for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅