首页> 外文期刊>EURASIP journal on bioinformatics and systems biology >Impact of Missing Value Imputation on Classification for DNA Microarray Gene Expression Data—A Model-Based Study
【24h】

Impact of Missing Value Imputation on Classification for DNA Microarray Gene Expression Data—A Model-Based Study

机译:缺失值插值对DNA微阵列基因表达数据分类的影响-基于模型的研究

获取原文
           

摘要

Many missing-value (MV) imputation methods have been developed for microarray data, but only a few studies have investigated the relationship between MV imputation and classification accuracy. Furthermore, these studies are problematic in fundamental steps such as MV generation and classifier error estimation. In this work, we carry out a model-based study that addresses some of the issues in previous studies. Six popular imputation algorithms, two feature selection methods, and three classification rules are considered. The results suggest that it is beneficial to apply MV imputation when the noise level is high, variance is small, or gene-cluster correlation is strong, under small to moderate MV rates. In these cases, if data quality metrics are available, then it may be helpful to consider the data point with poor quality as missing and apply one of the most robust imputation algorithms to estimate the true signal based on the available high-quality data points. However, at large MV rates, we conclude that imputation methods are not recommended. Regarding the MV rate, our results indicate the presence of a peaking phenomenon: performance of imputation methods actually improves initially as the MV rate increases, but after an optimum point, performance quickly deteriorates with increasing MV rates.
机译:已经为微阵列数据开发了许多缺失值(MV)插补方法,但只有少数研究调查了MV插补与分类准确性之间的关系。此外,这些研究在诸如MV产生和分类器误差估计的基本步骤中是有问题的。在这项工作中,我们进行了基于模型的研究,以解决先前研究中的一些问题。考虑了六种流行的归因算法,两种特征选择方法和三种分类规则。结果表明,在中低MV速率下,当噪声水平高,方差小或基因-簇相关性强时,应用MV插补是有益的。在这些情况下,如果数据质量指标可用,则将质量较差的数据点视为丢失并应用最可靠的插补算法之一基于可用的高质量数据点来估计真实信号可能会有所帮助。但是,在较大的MV速率下,我们得出结论,不建议使用插补方法。关于MV速率,我们的结果表明存在峰值现象:插值方法的性能实际上随着MV速率的增加而开始改善,但是在最佳点之后,性能随MV速率的增加而迅速恶化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号