Metrics for Evaluation of Word-Level Machine Translation Quality Estimation

机译：词级机器翻译质量评估的评估指标

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The aim of this paper is to investigate suitable evaluation strategies for the task of word-level quality estimation of machine translation. We suggest various metrics to replace F_1-score for the "BAD" class, which is currently used as main metric. We compare the metrics' performance on real system outputs and synthetically generated datasets and suggest a reliable alternative to the F_1-BAD score - the multiplication of F_1 -scores for different classes. Other metrics have lower discriminative power and are biased by unfair labellings.

机译：本文的目的是研究适合机器翻译的单词级质量估计任务的评估策略。我们建议使用各种指标来代替“ BAD”类的F_1分数，该类目前已用作主要指标。我们比较了指标在实际系统输出和综合生成的数据集上的性能，并提出了F_1-BAD分数的可靠替代方案-不同类别的F_1分数相乘。其他指标具有较低的辨别力，并受到不公平标签的偏见。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2016年|585-590|共6页
会议地点
作者
Varvara Logacheva; Michal Lukasik; Lucia Specia;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Predicting insertion positions in word-level machine translation quality estimation [J] . Espla-Gomis Miquel, Sanchez-Martinez Felipe, Forcada Mikel L. Applied Soft Computing . 2019,第期

机译：预测词级机器翻译质量估计中的插入位置
2. Word-level confidence estimation for machine translation [J] . Ueffing N, Ney H Computational linguistics . 2007,第1期

机译：机器翻译的词级置信度估计
3. Word-Level Confidence Estimation for Machine Translation [J] . Nicola Ueffing, Hermann Ney Computational linguistics . 2007,第1期

机译：机器翻译的词级置信度估计
4. Metrics for Evaluation of Word-Level Machine Translation Quality Estimation [C] . Varvara Logacheva, Michal Lukasik, Lucia Specia Annual meeting of the Association for Computational Linguistics . 2016

机译：评估词级机器翻译质量估计的指标
5. Translation technology and translation quality: The use of machine translation and computer-assisted translation and its implications for translation quality control [D] . Sun, Haichen 2005

机译：翻译技术和翻译质量：机器翻译和计算机辅助翻译的使用及其对翻译质量控制的影响
6. Machine or Human? Evaluating the Quality of a Language Translation Mobile App for Diabetes Education Material [O] . Xuewei Chen, Sandra Acosta, Adam E Barry 2017

机译：机器还是人？评估糖尿病教育材料的语言翻译移动应用程序的质量
7. Metrics for Evaluation of Word-level Machine Translation Quality Estimation [O] . Varvara Logacheva, Michal Lukasik, Lucia Specia 2016

机译：评估词级机器翻译质量估计的指标

Metrics for Evaluation of Word-Level Machine Translation Quality Estimation

摘要

著录项

相似文献

相关主题

期刊订阅