首页> 外文会议>Workshop on Asian translation >Patent NMT integrated with Large Vocabulary Phrase Translation by SMT at WAT 2017

【24h】

Patent NMT integrated with Large Vocabulary Phrase Translation by SMT at WAT 2017

机译：专利NMT与大型词汇短语转换为2017年SMT

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural machine translation (NMT) cannot handle a larger vocabulary because the training complexity and decoding complexity proportionally increase with the number of target words. This problem becomes even more serious when translating patent documents, which contain many technical terms that are observed infrequently. Long et al. (2017) proposed to select phrases that contain out-of-vocabulary words using the statistical approach of branching entropy. The selected phrases are then replaced with tokens during training and post-translated by the phrase translation table of SMT. In this paper, we apply the method proposed by Long et al. (2017) to the WAT 2017 Japanese-Chinese and Japanese-English patent datasets. Evaluation on Japanese-to-Chinese, Chinese-to-Japanese, Japanese-to-English and English-to-Japanese patent sentence translation proved the effectiveness of phrases selected with branching entropy, where the NMT model of Long et al. (2017) achieves a substantial improvement over a baseline NMT model without the technique proposed by Long et al. (2017).

机译：神经机翻译（NMT）无法处理更大的词汇，因为训练复杂性和解码复杂性与目标单词的数量成比例地增加。在翻译专利文献时，这个问题变得更加严重，其中包含很少观察到的许多技术术语。朗等人。（2017）建议选择使用分支熵的统计方法含有词汇外单词的短语。然后，所选短语在培训期间用令牌替换为令牌，并由SMT的“句子翻译表”句子后翻译。在本文中，我们应用了Long等人提出的方法。（2017）向2017年日本 - 中文和日语专利数据集。日本对汉语，日语，日语和英语到日语专利句子翻译的评估证明了用分支熵选择的短语的有效性，其中Long等人的NMT模型。（2017）在没有Long等人提出的技术上，通过基线NMT模型实现了大量改进。（2017）。

著录项

来源
《Workshop on Asian translation》|2017年|xi 170 p.|共9页
会议地点
作者
Zi Long; Ryuichiro Kimura; Takehito Utsuro; Tomoharu Mitsuhashi; Mikio Yamamoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Neural machine translation of low-resource languages using SMT phrase pair injection [J] . Sukanta Sen, Mohammed Hasanuzzaman, Asif Ekbal, Natural language engineering . 2021,第Pta3期

机译：使用SMT短语对注射的低资源语言的神经机翻译
2. Introducing a Translation Dictionary into Phrase-Based SMT [J] . Hideo OKUMA, Hirofumi YAMAMOTO, Eiichiro SUMITA IEICE Transactions on Information and Systems . 2008,第7期

机译：将翻译词典引入基于短语的SMT中
3. Patent Issued for Methods for Using Manual Phrase Alignment Data to Generate Translation Models for Statistical Machine Translation [J] . Robotics and Machine Learning . 2012,第32期

机译：使用手动短语对齐数据来生成用于统计机器翻译的翻译模型的方法已颁发专利
4. Patent NMT integrated with Large Vocabulary Phrase Translation by SMT at WAT 2017 [C] . Zi Long, Ryuichiro Kimura, Takehito Utsuro, 4th workshop on Asian translation . 2017

机译：SMT在WAT 2017上将专利NMT与大词汇量翻译相结合
5. Words, Thoughts, and Phrases: Defining and Measuring Literalness in English Bible Translations [D] . Rhine, Charles C. 2021

机译：单词，思想和短语：在英语圣经翻译中定义和衡量文字
6. Book Review: English German French Italian Spanish Medical Vocabulary and Phrases [O] . 1939

机译：书评：英语德语法语意大利语西班牙语医疗词汇和短语
7. The AMU-UEDIN Submission to the WMT16 News Translation Task: Attention-based NMT Models as Feature Functions in Phrase-based SMT [O] . Junczys-Dowmunt, Marcin, Dwojak, Tomasz, Sennrich, Rico 2016

机译：amU-UEDIN提交WmT16新闻翻译任务：基于注意的NmT模型作为基于短语的smT中的特征功能

Patent NMT integrated with Large Vocabulary Phrase Translation by SMT at WAT 2017

摘要

著录项

相似文献

相关主题

期刊订阅