首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Alternative Objective Functions for Training MT Evaluation Metrics
【24h】

Alternative Objective Functions for Training MT Evaluation Metrics

机译:培训MT评估指标的替代目标功能

获取原文

摘要

MT evaluation metrics are tested for correlation with human judgments either at the sentence- or the corpus-level. Trained metrics ignore corpus-level judgments and are trained for high sentence-level correlation only. We show that training only for one objective (sentence or corpus level), can not only harm the performance on the other objective, but it can also be subopti-mal for the objective being optimized. To this end we present a metric trained for corpus-level and show empirical comparison against a metric trained for sentence-level exemplifying how their performance may vary per language pair, type and level of judgment. Subsequently we propose a model trained to optimize both objectives simultaneously and show that it is far more stable than-and on average outperformsboth models on both objectives.
机译:在句子或语料库级别测试MT评估指标是否与人类判断相关。训练有素的度量标准会忽略语料库级别的判断,而仅针对高句子级别的相关性进行训练。我们表明,仅针对一个目标(句子或语料库水平)进行训练,不仅会损害另一目标的性能,而且对于正在优化的目标而言可能不是最佳的。为此,我们提出了一种经过语料库训练的度量,并显示了与经过句子级训练的度量的经验比较,以说明他们的表现如何随语言对,判断类型和判断水平的变化而变化。随后,我们提出了一个训练有素的模型,该模型可以同时优化两个目标,并且显示出比两个目标都稳定得多的模型,并且平均而言,这两个模型的性能均优于两个模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号