首页> 外文期刊>Computational statistics & data analysis >A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models
【24h】

A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models

机译:不完整分类数据和潜在类模型的简单快速替代EM算法的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Incomplete categorical data and latent class models play an important role in biostatistical and medical literature. The most common maximum likelihood procedure for accommodating these types of models is the EM algorithm. We present a faster alternative to these EM approaches that improves upon a recently introduced maximum likelihood-based alternative by Molenberghs and Goetghebeur (1997. J. Roy. Statist. Soc. Ser. B 59, 401-414) in two ways: by accommodating higher-dimensional problems via more time points in longitudinal problems and by employing a less tedious iteratively reweighted least-squares (IRLS) approach than the Newton-Raphson procedure used by MG. This IRLS approach also will facilitate the potential extension to models with random effects in the context of complete and incomplete categorical data and latent classes. We illustrate our method with a latent class application. As with the MG approach, we maximize the observed likelihood instead of the complete data likelihood under a multivariate generalized logistic model with composite link function. This results in a faster convergence rate than the EM algorithm, and allowing easily obtainable variance estimates. We illustrate the proposed estimation procedure using data from an HIV study involving four dichotomous test measures on each individual, assuming a latent class disease variable with two levels.
机译:不完整的分类数据和潜在类别模型在生物统计学和医学文献中起着重要作用。适应这些类型的模型的最常见的最大似然过程是EM算法。我们提出了这些EM方法的一种更快的替代方法,它通过两种方式在Molenberghs和Goetghebeur(1997. J. Roy。Statist。Soc。Ser。B 59,401-414)最近引入的基于最大似然性的替代方法上进行了改进。通过解决纵向问题的更多时间点,并采用比MG使用的Newton-Raphson过程少的乏味的迭代最小加权平方(IRLS)方法,可以解决高维问题。在完整和不完整的分类数据和潜在类别的情况下,这种IRLS方法还将有助于潜在扩展具有随机效应的模型。我们用一个潜在的类应用程序来说明我们的方法。与MG方法一样,在具有复合链接函数的多元广义logistic模型下,我们将观察到的似然性最大化,而不是使完整数据似然性最大化。这导致比EM算法更快的收敛速度,并允许轻松获得方差估计。我们使用来自一项HIV研究的数据来说明建议的估算程序,该数据涉及针对每个个体的四种二分测试方法,并假设潜在疾病类别变量具有两个水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号