...
首页> 外文期刊>Computational statistics & data analysis >Nonparametric estimation of the threshold at an operating point on the ROC curve
【24h】

Nonparametric estimation of the threshold at an operating point on the ROC curve

机译:ROC曲线上某个工作点的阈值的非参数估计

获取原文
获取原文并翻译 | 示例
           

摘要

In the problem of binary classification (or medical diagnosis), the classification rule (or diagnostic test) produces a continuous decision variable which is compared to a critical value (or threshold). Test values above (or below) that threshold are called positive (or negative) for disease. The two types of errors associated with every threshold value are Type I (false positive) and Type II (false negative) errors. The Receiver Operating Curve (ROC) describes the relationship between probabilities of these two types of errors. The inverse problem is considered; i.e., given the ROC curve (or its estimate) of a particular classification rule, one is interested in finding the value of the threshold that leads to a specific operating point on that curve. A nonparametric method for estimating the threshold is proposed. Asymptotic distribution is derived for the proposed estimator. Results from simulated data and real-world data are presented for finite sample size. Finding a particular threshold value is crucial in medical diagnoses, among other fields, where a medical test is used to classify a patient as "diseased" or "nondiseased" based on comparing the test result to a particular threshold value. When the ROC is estimated, an operating point is obtained by fixing probability of one type of error, and obtaining the other one from the estimated curve. Threshold estimation can then be viewed as a quantile estimation for one distribution but with the utilization of the second one.
机译:在二进制分类(或医学诊断)问题中,分类规则(或诊断测试)会产生一个连续的决策变量,并将其与临界值(或阈值)进行比较。高于(或低于)该阈值的测试值被称为疾病的阳性(或阴性)。与每个阈值关联的两种类型的错误是I型(误报)和II型(误报)错误。接收器工作曲线(ROC)描述了这两种类型的错误的概率之间的关系。考虑反问题;即,给定特定分类规则的ROC曲线(或其估计值),人们希望找到导致该曲线上特定操作点的阈值。提出了一种非参数的阈值估计方法。为所提出的估计量导出了渐近分布。给出了有限样本量的模拟数据和实际数据的结果。在医疗诊断中,找到特定的阈值是至关重要的,在其他领域中,根据测试结果与特定阈值的比较,使用医学测试将患者分类为“患病”或“未患病”。当估计ROC时,通过固定一种错误类型的概率并从估计的曲线中获得另一种错误类型来获得工作点。然后可以将阈值估计视为一种分布的分位数估计,但可以利用第二种分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号