Two parts are better than one: modeling marginal means of semicontinuous data

Valerie A. Smith; Brian Neelon; Matthew L. Maciejewski; John S. Preisser

首页> 外文期刊>Health services & outcomes research methodology >Two parts are better than one: modeling marginal means of semicontinuous data

【24h】

Two parts are better than one: modeling marginal means of semicontinuous data

机译：两部分优于一个：半连续数据的边际手段建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract In health services research, it is common to encounter semicontinuous data characterized by a point mass at zero followed by a continuous distribution with positive support. These are often analyzed using two-part mixtures that separately model the probability of use to account for the portion of the sample with zero values. Commonly, but not always, the second component models the continuous values conditional on them being positive. Prior work examining whether such two-part models are needed to appropriately draw inference from semicontinuous data compared to standard one-part regression models has found mixed results. However, prior studies have generally used only measures of model fit on a single dataset, leaving a definitive conclusion uncertain. This paper provides a detailed evaluation using simulations of the appropriateness of standard one-part generalized linear models (GLMs) compared to a recently developed marginalized two-part (MTP) model. The MTP model, unlike the one-part GLMs, explicitly accounts for the point mass at zero, yet takes the same form for the marginal mean as the commonly used GLM with log link, making the covariate effects directly comparable. We simulate data scenarios with varying sample sizes and percentages of zeros. One-part GLMs resulted in increased bias, lower than nominal coverage of confidence intervals, and inflated type I error rates, rendering them inappropriate for use with semicontinuous data. Even when distributional assumptions were violated, estimates of covariate effects and type I error rates under the MTP model remained robust.

机译：摘要在卫生服务研究中，常见的是遇到零点质量的半连续数据，然后具有正载体的连续分布。这些通常使用两部分混合来分析这些混合物，其单独模拟用于将样本部分的使用概率进行零值。通常，但并不总是，第二组件模拟连续值，条件是正的。在与标准的单零件回归模型相比，检查是否需要从半连续数据施加推断的这种两部分模型的工作，发现了混合结果。然而，在单个数据集上仅使用模型适合的模型措施，留下了明确的结论。本文提供了与最近开发的边缘化两部分（MTP）模型相比，使用标准单件广泛的线性模型（GLMS）的适当性模拟的详细评估。与单件GLM不同，MTP模型明确地占点质量为零的，但对于具有日志链路的常用GLM，对边际平均值相同的形式，使得协变量直接可比较。我们模拟数据方案，具有不同的样本尺寸和零的百分比。单件GLM导致偏置增加，低于置信区间的标称覆盖率，并膨胀I型错误率，呈现不适合与半连续数据一起使用。即使在违反分配假设时，MTP模型下的协变量和I型错误率的估计仍然是强劲的。

著录项

来源
《Health services & outcomes research methodology》 |2017年第4期|共21页
作者
Valerie A. Smith; Brian Neelon; Matthew L. Maciejewski; John S. Preisser;
展开▼
作者单位

Center for Health Services Research in Primary Care (152) Durham VA Medical Center;

Department of Public Health Sciences Medical University of South Carolina;

Center for Health Services Research in Primary Care (152) Durham VA Medical Center;

Department of Biostatistics University of North Carolina;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类放射卫生;
关键词
Generalized gamma distribution; Health care expenditures; Log-skew-normal distribution; Marginalized models; Two-part models;

机译：广义伽玛分布;卫生保健支出;对数偏斜正态分布;边缘化模型;两部分模型;

相似文献

外文文献
中文文献
专利

1. Two-part models with stochastic processes for modelling longitudinal semicontinuous data: Computationally efficient inference and modelling the overall marginal mean [J] . Yiu Sean, Tom Brian D. M. Statistical methods in medical research . 2018,第12期

机译：具有随机化纵向半连续数据的随机过程的两部分模型：计算高效推论和建模整体边际平均值
2. A marginalized two-part model with heterogeneous variance for semicontinuous data [J] . Smith Valerie A., Preisser John S. Statistical methods in medical research . 2019,第5期

机译：边缘化的两部分模型，具有半连续数据的异构方差
3. Analysis of longitudinal semicontinuous data using marginalized two-part model [J] . Miran A. Jaffa, Mulugeta Gebregziabher, Sara M. Garrett, Journal of Translational Medicine . 2018,第1期

机译：使用边缘化两部分模型分析纵向半连续数据
4. Approximate Calculation of Marginal Association Probabilities using a Hybrid Data Association Model [C] . Marcus Baum, Peter Willett, Yaakov Bar-Shalom, Conference on signal and data processing of small targets . 2014

机译：使用混合数据关联模型近似计算边际关联概率
5. Two Part Random Effect Models for Semicontinuous Data with Application to Toenail Data. [D] . Mian, Md. Rajibul Islam. 2012

机译：半连续数据的两部分随机效应模型及其在趾甲数据中的应用。
6. Two-part models with stochastic processes for modelling longitudinal semicontinuous data: Computationally efficient inference and modelling the overall marginal mean [O] . Sean Yiu, Brian DM Tom -1

机译：由两部分组成的具有随机过程的模型用于对纵向半连续数据进行建模：计算有效的推理和总体边际均值建模
7. Two-part models with stochastic processes for modelling longitudinal semicontinuous data: computationally efficient inference and modelling the overall marginal mean [O] . Yiu, Sean, Tom, Brian 2017

机译：具有随机过程的两部分模型，用于纵向建模半连续数据：计算有效的推理和建模总体边际均值

Two parts are better than one: modeling marginal means of semicontinuous data

摘要

著录项

相似文献

相关主题

期刊订阅