Alternative Objective Functions for Training MT Evaluation Metrics

机译：培训MT评估指标的替代目标功能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

MT evaluation metrics are tested for correlation with human judgments either at the sentence- or the corpus-level. Trained metrics ignore corpus-level judgments and are trained for high sentence-level correlation only. We show that training only for one objective (sentence or corpus level), can not only harm the performance on the other objective, but it can also be subopti-mal for the objective being optimized. To this end we present a metric trained for corpus-level and show empirical comparison against a metric trained for sentence-level exemplifying how their performance may vary per language pair, type and level of judgment. Subsequently we propose a model trained to optimize both objectives simultaneously and show that it is far more stable than-and on average outperformsboth models on both objectives.

机译：在句子或语料库级别测试MT评估指标是否与人类判断相关。训练有素的度量标准会忽略语料库级别的判断，而仅针对高句子级别的相关性进行训练。我们表明，仅针对一个目标（句子或语料库水平）进行训练，不仅会损害另一目标的性能，而且对于正在优化的目标而言可能不是最佳的。为此，我们提出了一种经过语料库训练的度量，并显示了与经过句子级训练的度量的经验比较，以说明他们的表现如何随语言对，判断类型和判断水平的变化而变化。随后，我们提出了一个训练有素的模型，该模型可以同时优化两个目标，并且显示出比两个目标都稳定得多的模型，并且平均而言，这两个模型的性能均优于两个模型。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2017年|20-25|共6页
会议地点
作者
Milos Stanojevic; Khalil Simaan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Subjective and objective evaluation of visual functions in dyslexic children with visual perceptual deficiency-Before and after ten weeks of perceptual training [J] . Leung Ka-Yan, Chan Henry Ho-Lung, Leung Mei-Po Research in developmental disabilities . 2018,第期

机译：具有视觉感知缺乏的障碍儿童视觉功能的主观和客观评估 - 感知训练前后十周之后
2. Alternative article-level metrics The use of alternative metrics in research evaluation [J] . Bornmann Lutz, Haunschild Robin EMBO reports . 2018,第12期

机译：替代文章级指标利用研究评估中的替代度量
3. New Metrics for Economic Evaluation in the Presence of Heterogeneity: Focusing on Evaluating Policy Alternatives Rather than Treatment Alternatives [J] . David D. Kim Medical decision making: An international journal of the Society for Medical Decision Making . 2017,第8期

机译：异质性存在的经济评估的新指标：重点关注评估政策替代方案而不是治疗方法
4. Alternative Objective Functions for Training MT Evaluation Metrics [C] . Milos Stanojevic, Khalil Simaan Annual meeting of the Association for Computational Linguistics . 2017

机译：培训MT评估指标的替代客观职能
5. Metrics to evaluate alternative watershed management policy outcomes using linear programming optimization and simulation of the Schuylkill River watershed in Southeastern Pennsylvania. [D] . Hesson, Molly D. 2013

机译：使用线性规划优化和宾夕法尼亚州东南部Schuylkill河分水岭的模拟评估替代分水岭管理政策成果的指标。
6. Severe limitations of the FEve metric of functional evenness and some alternative metrics [O] . Evsey Kosman, Samuel M. Scheiner, Hans‐Rolf Gregorius 2021

机译：功能性均匀度和一些替代度量的Feve度量的严重局限性
7. Alternative Objective Functions for Training MT Evaluation Metrics [O] . Miloš Stanojević, Khalil Simaan 2017

机译：培训MT评估指标的替代客观职能

Alternative Objective Functions for Training MT Evaluation Metrics

摘要

著录项

相似文献

相关主题

期刊订阅