...
首页> 外文期刊>Knowledge and Information Systems >Ranking-based evaluation of regression models
【24h】

Ranking-based evaluation of regression models

机译:基于排名的回归模型评估

获取原文
获取原文并翻译 | 示例
           

摘要

We suggest the use of ranking-based evaluation measures for regression models, as a complement to the commonly used residual-based evaluation. We argue that in some cases, such as the case study we present, ranking can be the main underlying goal in building a regression model, and ranking performance is the correct evaluation metric. However, even when ranking is not the contextually correct performance metric, the measures we explore still have significant advantages: They are robust against extreme outliers in the evaluation set; and they are interpretable. The two measures we consider correspond closely to non-parametric correlation coefficients commonly used in data analysis (Spearman's ρ and Kendall's τ); and they both have interesting graphical representations, which, similarly to ROC curves, offer useful various model performance views, in addition to a one-number summary in the area under the curve. An interesting extension which we explore is to evaluate models on their performance in “partially” ranking the data, which we argue can better represent the utility of the model in many cases. We illustrate our methods on a case study of evaluating IT Wallet size estimation models for IBM's customers.
机译:我们建议对回归模型使用基于排名的评估方法,以补充常用的基于残差的评估。我们认为,在某些情况下(例如我们目前的案例研究),排名可能是构建回归模型的主要基础目标,而排名表现是正确的评估指标。但是,即使排名不是根据上下文正确的绩效指标,我们探索的指标仍然具有显着的优势:对于评估集中的极端异常值,它们是可靠的;他们是可以解释的。我们认为这两个量度与数据分析中常用的非参数相关系数非常接近(斯皮尔曼的ρ和肯德尔的τ)。它们都具有有趣的图形表示,类似于ROC曲线,除了在曲线下方的区域中有一个数字摘要之外,还提供了有用的各种模型性能视图。我们探索的一个有趣的扩展是在“部分”排序数据时评估模型的性能,我们认为这可以更好地表示模型在许多情况下的效用。我们将在为IBM客户评估IT Wallet规模估计模型的案例研究中说明我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号