首页> 美国卫生研究院文献>other >The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
【2h】

The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

机译:在不平衡数据集上评估二进制分类器时精确调用图比ROC图更具信息性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.
机译:二进制分类器通常使用诸如灵敏度和特异性之类的性能指标进行评估,并且通常通过接收器工作特性(ROC)图来说明性能。诸如阳性预测值(PPV)和相关的精确度/召回率(PRC)图之类的替代量度则较少使用。许多生物信息学研究开发并评估了分类器,这些分类器将应用于严重失衡的数据集,其中负数要远远大于正数。尽管ROC图在外观上很吸引人,并提供了分类器在各种特定范围内的性能概述,但人们可以问ROC图在不平衡分类场景中使用时是否会产生误导作用。我们在这里表明,由于对特异性的直观而错误的解释,ROC图在不平衡数据集的上下文中的视觉可解释性对于分类性能的可靠性结论具有欺骗性。另一方面,PRC绘图可以为观看者提供对未来分类效果的准确预测,这是因为它们评估了阳性预测中真实阳性的比例。我们的发现对大量使用不平衡数据集上的ROC图的研究的解释具有潜在的意义。

著录项

  • 期刊名称 other
  • 作者

    Takaya Saito; Marc Rehmsmeier;

  • 作者单位
  • 年(卷),期 -1(10),3
  • 年度 -1
  • 页码 e0118432
  • 总页数 21
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号