...
首页> 外文期刊>Health services & outcomes research methodology >Two parts are better than one: modeling marginal means of semicontinuous data
【24h】

Two parts are better than one: modeling marginal means of semicontinuous data

机译:两部分优于一个:半连续数据的边际手段建模

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract In health services research, it is common to encounter semicontinuous data characterized by a point mass at zero followed by a continuous distribution with positive support. These are often analyzed using two-part mixtures that separately model the probability of use to account for the portion of the sample with zero values. Commonly, but not always, the second component models the continuous values conditional on them being positive. Prior work examining whether such two-part models are needed to appropriately draw inference from semicontinuous data compared to standard one-part regression models has found mixed results. However, prior studies have generally used only measures of model fit on a single dataset, leaving a definitive conclusion uncertain. This paper provides a detailed evaluation using simulations of the appropriateness of standard one-part generalized linear models (GLMs) compared to a recently developed marginalized two-part (MTP) model. The MTP model, unlike the one-part GLMs, explicitly accounts for the point mass at zero, yet takes the same form for the marginal mean as the commonly used GLM with log link, making the covariate effects directly comparable. We simulate data scenarios with varying sample sizes and percentages of zeros. One-part GLMs resulted in increased bias, lower than nominal coverage of confidence intervals, and inflated type I error rates, rendering them inappropriate for use with semicontinuous data. Even when distributional assumptions were violated, estimates of covariate effects and type I error rates under the MTP model remained robust.
机译:摘要在卫生服务研究中,常见的是遇到零点质量的半连续数据,然后具有正载体的连续分布。这些通常使用两部分混合来分析这些混合物,其单独模拟用于将样本部分的使用概率进行零值。通常,但并不总是,第二组件模拟连续值,条件是正的。在与标准的单零件回归模型相比,检查是否需要从半连续数据施加推断的这种两部分模型的工作,发现了混合结果。然而,在单个数据集上仅使用模型适合的模型措施,留下了明确的结论。本文提供了与最近开发的边缘化两部分(MTP)模型相比,使用标准单件广泛的线性模型(GLMS)的适当性模拟的详细评估。与单件GLM不同,MTP模型明确地占点质量为零的,但对于具有日志链路的常用GLM,对边际平均值相同的形式,使得协变量直接可比较。我们模拟数据方案,具有不同的样本尺寸和零的百分比。单件GLM导致偏置增加,低于置信区间的标称覆盖率,并膨胀I型错误率,呈现不适合与半连续数据一起使用。即使在违反分配假设时,MTP模型下的协变量和I型错误率的估计仍然是强劲的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号