Automatic Evaluation of English-to-Korean and Korean-to-English Neural Machine Translation Systems by Linguistic Test Points

机译：通过语言测试点自动评估英韩韩神经机器翻译系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

BLEU is the most well-known automatic evaluation technology in assessing the performance of machine translation systems. However, BLEU does not know which parts of the NMT translation results are good or bad. This paper describes the automatic evaluation approach of NMT systems by linguistic test points. This approach allows automatic evaluation of each linguistic test point not shown in BLEU and provides intuitive insight into the strengths and flaws of NMT systems in handling various important linguistic test points. The linguistic test points used for automatic evaluation were 58 and consisted of 630 sentences. We conducted the evaluation of two bidirectional English/Korean NMT systems. BLEUs of English-to-Korean NMT systems were 0.0898 and 0.2081 respectively, and their automatic evaluations by linguistic test points were 58.35% and 77.31%, respectively. BLEUs of Korean-to-English NMT systems were 0.3939 and 0.4512 respectively, and their automatic evaluations by linguistic test points were 33.10% and 40.47%, respectively. This means that the automatic evaluation approach by linguistic test points has similar results as BLEU assessment. According to automatic evaluation by linguistic test points, we know that both English-to-Korean NMT systems and Korean-to-English NMT systems have strengths in polysemy translations, but has flaws in style translations and translations of sentences with complex syntactic structures.

机译：BLEU是评估机器翻译系统性能的最著名的自动评估技术。但是，BLEU不知道NMT翻译结果的哪些部分是好是坏。本文通过语言测试点描述了NMT系统的自动评估方法。这种方法可以自动评估BLEU中未显示的每个语言测试点，并直观地了解NMT系统在处理各种重要语言测试点时的优缺点。用于自动评估的语言测试点为58个，由630个句子组成。我们对两个双向的英语/韩文NMT系统进行了评估。英韩NMT系统的BLEU分别为0.0898和0.2081，通过语言测试点进行的自动评估分别为58.35％和77.31％。韩英NMT系统的BLEU分别为0.3939和0.4512，通过语言测试点进行的自动评估分别为33.10％和40.47％。这意味着通过语言测试点进行的自动评估方法具有与BLEU评估相似的结果。根据语言测试点的自动评估，我们知道英韩NMT系统和韩英NMT系统在多义翻译中都有优势，但是在样式翻译和具有复杂句法结构的句子翻译中存在缺陷。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation》|2018年|107-114|共8页
会议地点
作者
Sung-Kwon Choi; Gyu-Hyeun Choi; Youngkil Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. VERTa: a linguistic approach to automatic machine translation evaluation [J] . Comelles Elisabet, Atserias Jordi Language Resources and Evaluation . 2019,第1期

机译：VERTa：一种自动机器翻译评估的语言方法
2. Study and Correlation Analysis of Linguistic, Perceptual,and Automatic Machine Translation Evaluations [J] . Mireia Farrus, Marta R. Costa-jussa, Maja Popovic Journal of the American Society for Information Science and Technology . 2012,第1期

机译：语言，知觉和自动机器翻译评估的研究和相关分析
3. Linguistic measures for automatic machine translation evaluation [J] . Jesus Gimenez, Lluis Marquez Machine translation . 2010,第3a4期

机译：机器翻译自动评估的语言措施
4. Automatic Evaluation of English-to-Korean and Korean-to-English Neural Machine Translation Systems by Linguistic Test Points [C] . Sung-Kwon Choi, Gyu-Hyeun Choi, Youngkil Kim Pacific Asia Conference on Language, Information and Computation . 2018

机译：通过语言测试点自动评估英语到韩语和韩国人的神经机翻译系统
5. Improving Neural Net Machine Translation Systems with Linguistic Information [D] . Chen, Yuan-Lu 2018

机译：利用语言信息改进神经网络机器翻译系统
6. Neural systems supporting linguistic structure linguistic experience and symbolic communication in sign language and gesture [O] . Aaron J. Newman, Ted Supalla, Nina Fernandez, 2015

机译：支持手语和手势的语言结构语言体验以及符号交流的神经系统
7. The Contribution of Linguistic Features to Automatic Machine Translation Evaluation [O] . Julio Gonzalo, Felisa Verdejo 2015

机译：语言特征对机器自动翻译评价的贡献

Automatic Evaluation of English-to-Korean and Korean-to-English Neural Machine Translation Systems by Linguistic Test Points

摘要

著录项

相似文献

相关主题

期刊订阅