首页> 美国卫生研究院文献>other >A Tutorial on Rank-based Coefficient Estimation for Censored Data in Small- and Large-Scale Problems
【2h】

A Tutorial on Rank-based Coefficient Estimation for Censored Data in Small- and Large-Scale Problems

机译:基于秩的基于级别的系数估计的教程对小型和大规​​模问题进行了审查的数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The analysis of survival endpoints subject to right-censoring is an important research area in statistics, particularly among econometricians and biostatisticians. The two most popular semiparametric models are the proportional hazards model and the accelerated failure time (AFT) model. Rank-based estimation in the AFT model is computationally challenging due to optimization of a non-smooth loss function. Previous work has shown that rank-based estimators may be written as solutions to linear programming (LP) problems. However, the size of the LP problem is O(n2 + p) subject to n2 linear constraints, where n denotes sample size and p denotes the dimension of parameters. As n and/or p increases, the feasibility of such solution in practice becomes questionable. Among data mining and statistical learning enthusiasts, there is interest in extending ordinary regression coefficient estimators for low-dimensions into high-dimensional data mining tools through regularization. Applying this recipe to rank-based coefficient estimators leads to formidable optimization problems which may be avoided through smooth approximations to non-smooth functions. We review smooth approximations and quasi-Newton methods for rank-based estimation in AFT models. The computational cost of our method is substantially smaller than the corresponding LP problem and can be applied to small- or large-scale problems similarly. The algorithm described here allows one to couple rank-based estimation for censored data with virtually any regularization and is exemplified through four case studies.
机译:右审查受到审查的生存终点的分析是统计数据的重要研究领域,特别是经济学家和生物统治者。两个最流行的半造型模型是比例危险模型和加速故障时间(AFT)模型。由于非平滑损耗功能的优化,AFT模型中基于秩的估计是在计算上具有挑战性的。以前的工作表明,基于秩的估计器可以写入线性编程(LP)问题的解决方案。然而,LP问题的大小是O(n 2 + p),受n 2 线性约束,其中n表示样本大小,p表示参数的尺寸。随着N和/或P增加,这种解决方案在实践中的可行性变得可疑。在数据挖掘和统计学习爱好者中,有兴趣通过正规化将低维层的普通回归系数估算扩展到高维数据挖掘工具中。将该配方应用于基于秩的系数估计,导致可强大的优化问题,可以通过平滑近似来避免到非平滑功能。我们审查了SHT模型中基于秩的估计的平滑近似和准牛顿方法。我们的方法的计算成本基本上小于相应的LP问题,并且可以类似地应用于小或大规模的问题。这里描述的算法允许将基于义的数据的估计与几乎任何正则化耦合到官方数据,并且通过四个案例研究示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号