首页> 美国卫生研究院文献>other >Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings
【2h】

Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings

机译:评估评分者和主题特征对序数评分的一致性度量的影响

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Widespread inconsistencies are commonly observed between physicians’ ordinal classifications in screening tests results such as mammography. These discrepancies have motivated large-scale agreement studies where many raters contribute ratings. The primary goal of these studies is to identify factors related to physicians and patients’ test results which may lead to stronger consistency between raters’ classifications. While ordered categorical scales are frequently used to classify screening test results, very few statistical approaches exist to model agreement between multiple raters. Here we develop a flexible and comprehensive approach to assess the influence of rater and subject characteristics on agreement between multiple raters’ ordinal classifications in large-scale agreement studies. Our approach is based upon the class of generalized linear mixed models. Novel summary model-based measures are proposed to assess agreement between all, or a subgroup of raters, such as experienced physicians. Hypothesis tests are described to formally identify factors such as physicians’ level of experience that play an important role in improving consistency of ratings between raters. We demonstrate how unique characteristics of individual raters can be assessed via conditional modes generated during the modeling process. Simulation studies are presented to demonstrate the performance of the proposed methods and summary measure of agreement. The methods are applied to a large-scale mammography agreement study to investigate the effects of rater and patient characteristics on the strength of agreement between radiologists.
机译:通常在乳腺X线摄影等筛查测试结果中,医生按序数分类之间会发现广泛的不一致之处。这些差异激发了大规模的协议研究,其中许多评估者贡献了评级。这些研究的主要目的是确定与医师和患者的检测结果有关的因素,这些因素可能导致评分者的分类之间更加一致。虽然有序分类量表经常用于对筛查测试结果进行分类,但很少有统计方法可以对多个评估者之间的一致性进行建模。在这里,我们开发了一种灵活而全面的方法来评估大规模协议研究中评估者和主题特征对多个评估者序数分类之间的一致性的影响。我们的方法基于广义线性混合模型的类别。提出了基于新的基于概要模型的度量,以评估所有或部分评估者(例如经验丰富的医生)之间的一致性。假设检验旨在正式识别诸如医生的经验水平等因素,这些因素在提高评估者之间的评分一致性方面起着重要作用。我们演示了如何通过建模过程中生成的条件模式来评估各个评估者的独特特征。仿真研究表明了所提出的方法的性能和协议的简易性。该方法被应用于大规模的乳腺X线摄影协议研究,以研究评估者和患者特征对放射线医师之间协议强度的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号