...
首页> 外文期刊>Computational statistics & data analysis >A simple method for screening variables before clustering microarray data
【24h】

A simple method for screening variables before clustering microarray data

机译:在聚类微阵列数据之前筛选变量的简单方法

获取原文
获取原文并翻译 | 示例
           

摘要

A simple and computationally fast procedure is proposed for screening a large number of variables prior to cluster analysis. Each variable is considered in turn, the sample is divided into the two groups that maximise the ratio of between-group to within-group sum of squares for that variable, and the achieved value of this ratio is tested to see if it is significantly greater than what would be expected when partitioning a sample from a single homogeneous population. Those variables that achieve significance are then used in the cluster analysis. It is suggested that significance levels be assessed using a Monte Carlo computational procedure; by assuming within-group normality an analytical approximation is derived, but caution in its use is advocated. Computational details are provided for both the partitioning and the testing. The procedure is applied to several microarray data sets, showing that it can often achieve good results both quickly and simply.
机译:提出了一种简单且计算快速的过程,用于在聚类分析之前筛选大量变量。依次考虑每个变量,将样本分为两组,以使该变量的组间与组内平方和之比最大化,并测试该比率的实现值是否更大比从单个同质总体中划分样本时所期望的要好。然后,将达到显着性的那些变量用于聚类分析。建议使用蒙特卡洛计算程序评估显着性水平;通过假设组内正态性,可以得出分析近似值,但提倡谨慎使用。提供了分区和测试的计算详细信息。该程序已应用于多个微阵列数据集,表明它通常可以快速,简单地获得良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号