首页> 外文学位 >Robust model-based analysis of multivariate data with missing values.
【24h】

Robust model-based analysis of multivariate data with missing values.

机译:对缺失值的多元数据进行基于模型的鲁棒分析。

获取原文
获取原文并翻译 | 示例

摘要

The regression prediction method for inference from multivariate data with missing values is most useful when the covariates are predictive of the missing values and the probability of being missing. In these circumstances, predictions are particularly sensitive to model misspecification. One can make the model more robust and flexible using nonparametric regression. However, the high dimension of covariates leads to the "curse of dimensionality" problem. To address these problems, I propose a robust model-based method based on penalized splines (P-splines) of the response propensity scores. An estimator based on this method is called a propensity penalized spline prediction (PPSP) estimator. The key idea is to focus on correctly specifying the relationship between variables with missing values and propensity scores, since misspecification of this relationship leads to bias.; Valid estimates of variance of the PPSP estimator need to incorporate the added uncertainty due to nonresponse and propensity estimation. To account for the additional sources of variability, I develop methods based on the asymptotic variance, the bootstrap, and the multiple imputation.; The PPSP method is also extended to a general pattern of missing data by adapting and modifying Raghunathan et al.'s (2001) sequential regression multivariate imputation approach.; Simulation comparisons with other methods suggest that the proposed methods work well in a wide range of populations, with little loss of efficiency relative to parametric models when the latter are correct.
机译:当协变量可预测缺失值和缺失概率时,从具有缺失值的多元数据进行推断的回归预测方法中最有用。在这些情况下,预测对模型错误指定特别敏感。使用非参数回归可以使模型更健壮和灵活。但是,协变量的高维数导致“维数诅咒”问题。为了解决这些问题,我提出了基于响应倾向得分的惩罚样条(P-splines)的基于模型的鲁棒方法。基于此方法的估计器称为倾向惩罚样条预测(PPSP)估计器。关键思想是集中于正确指定具有缺失值和倾向得分的变量之间的关系,因为这种关系的错误指定会导致偏差。 PPSP估算器的有效方差估算需要考虑由于无响应和倾向性估算而增加的不确定性。为了说明可变性的其他来源,我开发了基于渐近方差,自举和多重插补的方法。通过修改和修改Raghunathan等人(2001年)的顺序回归多元插补方法,PPSP方法也扩展到丢失数据的一般模式。与其他方法的仿真比较表明,所提出的方法在各种人群中均能很好地工作,而当参数模型正确时,相对于参数模型而言,效率损失很小。

著录项

  • 作者

    An, Hyonggin.;

  • 作者单位

    University of Michigan.;

  • 授予单位 University of Michigan.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2004
  • 页码 99 p.
  • 总页数 99
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 统计学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号