Predicting COPD status with a random generalized linear model

Lin Song; Steve Horvath

首页> 外文期刊>Systems biomedicine. >Predicting COPD status with a random generalized linear model

【24h】

Predicting COPD status with a random generalized linear model

机译：使用随机广义线性模型预测COPD状态

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sample classification, especially disease status prediction, is an important area of investigation for gene expression studies. Many machine learning methods have been developed to tackle this problem. To evaluate different prediction methods, the IMPROVER Challenge made several data sets available. Here we focus on one sub-challenge: chronic obstructive pulmonary disease (COPD). We outlined critical preprocessing steps to make training and test data comparable. We compared our recently introduced random generalized linear model (RGLM) predictor with Leo Breiman’s random forest (RF) predictor on the COPD data set. We discussed potential reasons for the superior performance of the RGLM predictor in this sub-challenge. Interestingly, we found that although several genes were highly predictive of COPD status, none were necessary to achieve accurate prediction when demographic features smoking status and age were used. In conclusion, RGLM achieved superior predictive accuracy for predicting COPD status with smoking status and age as mandatory features. Future cohort studies could evaluate whether the resulting predictor has clinical utility.

机译：样本分类，尤其是疾病状态预测，是基因表达研究的重要研究领域。已经开发了许多机器学习方法来解决这个问题。为了评估不同的预测方法，IMPROVER挑战赛提供了多个数据集。在这里，我们集中于一项子挑战：慢性阻塞性肺疾病（COPD）。我们概述了关键的预处理步骤，以使培训和测试数据具有可比性。我们在COPD数据集上比较了最近推出的随机广义线性模型（RGLM）预测器和Leo Breiman的随机森林（RF）预测器。我们讨论了在此子挑战中RGLM预测器具有出色性能的潜在原因。有趣的是，我们发现，尽管有几个基因可以高度预测COPD的状况，但是当使用人口统计学特征吸烟状况和年龄时，对于准确预测COPD而言，没有一个基因是必需的。总之，RGLM在以吸烟状况和年龄为强制特征来预测COPD状况方面获得了卓越的预测准确性。未来的队列研究可以评估所得的预测指标是否具有临床实用性。

著录项

来源
《Systems biomedicine.》 |2013年第4期|共7页
作者
Lin Song; Steve Horvath;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类基础医学;
关键词

相似文献

外文文献
中文文献
专利

1. Modeling Linguistic Variables With Regression Models: Addressing Non-Gaussian Distributions, Non-independent Observations, and Non-linear Predictors With Random Effects and Generalized Additive Models for Location, Scale, and Shape [J] . Christophe Coup?? Frontiers in Psychology . 2018,第1期

机译：使用回归模型为语言变量建模：处理具有随机效应的非高斯分布，非独立观测值和非线性预测变量以及位置，尺度和形状的广义加性模型
2. Random generalized linear model: a highly accurate and interpretable ensemble predictor [J] . Lin Song, Peter Langfelder, Steve Horvath BMC Bioinformatics . 2013,第1期

机译：随机广义线性模型：高度准确且可解释的整体预测器
3. Power analysis for cluster randomized trials with binary outcomes modeled by generalized linear mixed-effects models [J] . Chen T., Lu N., Arora J., Journal of applied statistics . 2016,第5a8期

机译：用广义线性混合效应模型建模的二元结果的集群随机试验的功效分析
4. SPARSE GENERALIZED FUNCTIONAL LINEAR MODEL FOR PREDICTING REMISSION STATUS OF DEPRESSION PATIENTS [C] . YASHU LIU, ZHI NIE, JIAYU ZHOU, Pacific Symposium on Biocomputing . 2014

机译：预测抑郁症患者缓解状态的稀疏广义功能线性模型
5. Effect of qigong on physical and psychosocial status of Chinese COPD patients: A randomized controlled trial [D] . Ng, Hin Po Bobby 2010

机译：气功对中国慢性阻塞性肺病患者身心健康状况的影响：一项随机对照试验
6. Modeling Linguistic Variables With Regression Models: Addressing Non-Gaussian Distributions Non-independent Observations and Non-linear Predictors With Random Effects and Generalized Additive Models for Location Scale and Shape [O] . Christophe Coupé -1

机译：使用回归模型为语言变量建模：处理具有随机效应的非高斯分布非独立观测值和非线性预测变量以及位置尺度和形状的广义加性模型
7. Predicting COPD status with a random generalized linear model [O] . Lin Song, Steve Horvath 2013

机译：预测随机广义线性模型的COPD状态

Predicting COPD status with a random generalized linear model

摘要

著录项

相似文献

相关主题

期刊订阅