首页> 外文期刊>International Journal of Advanced Statistics and Probability >Missing data treatment method on cluster analysis
【24h】

Missing data treatment method on cluster analysis

机译:聚类分析缺失数据处理方法

获取原文
           

摘要

The missing data in household health survey was challenged for the researcher because of incomplete analysis. The statistical tool cluster analysis methodology implemented in the collected data of Sudan's household health survey in 2006. Current research specifically focuses on the data analysis as the objective is to deal with the missing values in cluster analysis. Two-Step Cluster Analysis is applied in which each participant is classified into one of the identified pattern and the optimal number of classes is determined using SPSS Statistics/IBM. However, the risk of over-fitting of the data must be considered because cluster analysis is a multivariable statistical technique. Any observation with missing data is excluded in the Cluster Analysis because like multi-variable statistical techniques. Therefore, before performing the cluster analysis, missing values will be imputed using multiple imputations (SPSS Statistics/IBM). The clustering results will be displayed in tables. The descriptive statistics and cluster frequencies will be produced for the final cluster model, while the information criterion table will display results for a range of cluster solutions.
机译:由于分析不完整,家庭健康调查中缺少的数据对研究人员提出了挑战。统计工具聚类分析方法在2006年苏丹的家庭健康调查收集的数据中实施。当前的研究专门针对数据分析,因为其目的是处理聚类分析中的缺失值。应用两步聚类分析,其中将每个参与者分类为已识别的模式之一,并使用SPSS Statistics / IBM确定最佳类别数。但是,由于聚类分析是一种多变量统计技术,因此必须考虑数据过度拟合的风险。像多变量统计技术一样,在聚类分析中将排除任何缺少数据的观察。因此,在执行聚类分析之前,将使用多个插补(SPSS Statistics / IBM)插补缺失值。聚类结果将显示在表格中。将为最终的聚类模型生成描述性统计信息和聚类频率,而信息标准表将显示一系列聚类解决方案的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号