首页> 外文学位 >Missing data methods for clustered longitudinal data.
【24h】

Missing data methods for clustered longitudinal data.

机译:集群纵向数据的缺失数据方法。

获取原文
获取原文并翻译 | 示例

摘要

Recently medical and public health research has focused on the development of models for longitudinal studies that aim to identify individuals at risk for obesity by tracking childhood weight gain. The National Longitudinal Surveys of Youth 79 (NLSY79), which includes a random sample of women with biometric information on their biological children collected biennially, has been considered. A mixed model with three levels of clustered random effects has been proposed for the estimation of child-specific weight trajectories while accounting the nested structure of the dataset. Included in this model is a regression equation approach to address any remaining heterogeneity in the within-child variances. Specifically, a model has been implemented to fit the log of the within-child variances as a function of time. This allows for more flexibility in modeling residual variances that appear to be increasing over time. Using the EM algorithm with a Newton-Raphson update all the parameters of the model are estimated simultaneously.;A second aspect to the research that is presented is the analysis of missing data. Extensive exploratory analysis revealed that intermittent missingness was prevalent in the relevant subset of the NLSY79 dataset. Starting with the assumptions of MCAR and MAR selection models are built up to appropriately account for the missing mechanism at play. A factorization of the multinomial distribution as a product of dependent binary observations is applied to model intermittent missingness. Logit models for dependent binary observations are used to fit selection models for missingness under the assumptions of MAR and MCAR. The NMAR case for clustered longitudinal data is discussed as an area for future research.
机译:最近,医学和公共卫生研究集中于纵向研究模型的开发,该模型旨在通过跟踪儿童的体重增加来识别有肥胖风险的个体。已经考虑了《全国青年纵向调查79》(NLSY79),其中包括一个随机样本的妇女,每两年一次收集有关其生子女的生物特征信息。已经提出了具有三个级别的聚类随机效应的混合模型,用于在考虑数据集的嵌套结构的同时估计儿童特定的体重轨迹。该模型包括一种回归方程方法,以解决子代内方差中任何剩余的异质性。具体而言,已经实现了一种模型,以拟合作为时间函数的子内部方差的对数。这为建模随时间增加的残差方差提供了更大的灵活性。将EM算法与Newton-Raphson更新一起使用,可以同时估计模型的所有参数。;提出的第二个方面是对缺失数据的分析。广泛的探索性分析显示,间歇性缺失在NLSY79数据集的相关子集中非常普遍。首先从建立MCAR和MAR选择模型的假设出发,以适当考虑游戏中的缺失机制。将多项式分布的因式分解作为相关二元观测值的乘积应用于模型间歇性缺失。在MAR和MCAR的假设下,用于相关二元观测的Logit模型用于拟合缺失模型。讨论了纵向数据聚类的NMAR案例,作为未来研究的领域。

著录项

  • 作者

    Modur, Sharada.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Biology Biostatistics.;Statistics.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 164 p.
  • 总页数 164
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号