A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models

Andrzej T. Galecki; Thomas R. Ten Have; Geert Molenberghs

首页> 外文期刊>Computational statistics & data analysis >A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models

【24h】

A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models

机译：不完整分类数据和潜在类模型的简单快速替代EM算法的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Incomplete categorical data and latent class models play an important role in biostatistical and medical literature. The most common maximum likelihood procedure for accommodating these types of models is the EM algorithm. We present a faster alternative to these EM approaches that improves upon a recently introduced maximum likelihood-based alternative by Molenberghs and Goetghebeur (1997. J. Roy. Statist. Soc. Ser. B 59, 401-414) in two ways: by accommodating higher-dimensional problems via more time points in longitudinal problems and by employing a less tedious iteratively reweighted least-squares (IRLS) approach than the Newton-Raphson procedure used by MG. This IRLS approach also will facilitate the potential extension to models with random effects in the context of complete and incomplete categorical data and latent classes. We illustrate our method with a latent class application. As with the MG approach, we maximize the observed likelihood instead of the complete data likelihood under a multivariate generalized logistic model with composite link function. This results in a faster convergence rate than the EM algorithm, and allowing easily obtainable variance estimates. We illustrate the proposed estimation procedure using data from an HIV study involving four dichotomous test measures on each individual, assuming a latent class disease variable with two levels.

机译：不完整的分类数据和潜在类别模型在生物统计学和医学文献中起着重要作用。适应这些类型的模型的最常见的最大似然过程是EM算法。我们提出了这些EM方法的一种更快的替代方法，它通过两种方式在Molenberghs和Goetghebeur（1997. J. Roy。Statist。Soc。Ser。B 59，401-414）最近引入的基于最大似然性的替代方法上进行了改进。通过解决纵向问题的更多时间点，并采用比MG使用的Newton-Raphson过程少的乏味的迭代最小加权平方（IRLS）方法，可以解决高维问题。在完整和不完整的分类数据和潜在类别的情况下，这种IRLS方法还将有助于潜在扩展具有随机效应的模型。我们用一个潜在的类应用程序来说明我们的方法。与MG方法一样，在具有复合链接函数的多元广义logistic模型下，我们将观察到的似然性最大化，而不是使完整数据似然性最大化。这导致比EM算法更快的收敛速度，并允许轻松获得方差估计。我们使用来自一项HIV研究的数据来说明建议的估算程序，该数据涉及针对每个个体的四种二分测试方法，并假设潜在疾病类别变量具有两个水平。

著录项

来源
《Computational statistics & data analysis》 |2001年第3期|共17页
作者
Andrzej T. Galecki; Thomas R. Ten Have; Geert Molenberghs;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
categorical data; multivariate marginal logistic models; latent class models; incomplete data; coarsening;

机译：分类数据;多元边际逻辑模型;隐性类模型;不完整数据;粗化;

相似文献

外文文献
中文文献
专利

1. A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models [J] . Andrzej T. Galecki, Thomas R. Ten Have, Geert Molenberghs Computational statistics & data analysis . 2001,第3期

机译：不完整分类数据和潜在类模型的简单快速替代EM算法的方法
2. An alternative to classical latent class models selection methods for sparse binary data: an illustration with simulated data [J] . Araya Alpízar Carlomagno Revista de Matemática Teoría y Aplicaciones . 2016,第1期

机译：稀疏二进制数据的经典潜在类模型选择方法的替代方法：带有模拟数据的插图
3. Latent class models for multiple ordered categorical health data: testing violation of the local independence assumption [J] . Li Donni Paolo, Thomas Ranjeeta Empirical Economics . 2020,第4期

机译：多个订购分类健康数据的潜在类模型：违反本地独立假设的测试
4. Fast (Incremental) Algorithms for Useful Classes of Simple Temporal Problems with Preferences [C] . T. K. Satish Kumar Twentieth International Joint Conference on Artificial Intelligence(IJCAI-07) . 2007

机译：具有偏好的简单时间问题的有用类的快速（增量）算法
5. A Comparison of Sixteen Classification Strategies of Rule Induction from Incomplete Data Using the MLEM2 Algorithm [D] . Nelakurthi, Venkata Siva Pavan Kumar Kumar. 2020

机译：使用MLEM2算法对不完全数据的十六分类策略的比较
6. Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data [O] . Davide Vidotto, Jeroen K. Vermunt, Katrijn van Deun -1

机译：嵌套分类数据的多重插补的贝叶斯多级潜在类模型
7. 9 MULTIPLE IMPUTATION OF INCOMPLETE CATEGORICAL DATA USING LATENT CLASS ANALYSIS [O] . Jeroen K. Vermunt, Joost R. Van Ginkel, L. Andries Van Der Ark, 2015

机译：9使用潜在类别分析对不完整的类别数据进行多次计算

A simple and fast alternative to the EM algorithm for incomplete categorical data and latent class models

摘要

著录项

相似文献

相关主题

期刊订阅