首页> 外文会议>Proceedings of the 5th International Conference on Computer Science and Education >Date clustering using Principal Component Analysis and Particle Swarm Optimization
【24h】

Date clustering using Principal Component Analysis and Particle Swarm Optimization

机译:使用主成分分析和粒子群算法的数据聚类

获取原文

摘要

By analyzing the actuality of the data mining in ecommerce environment, and considering the complexity and the curse of dimensionality about extracting the implicit and unknown knowledge brought by the massive and high-dimensional data. Based on the K-means clustering, Particle Swarm Optimization (PSO) clustering and hybrid PSO clustering algorithm, this paper presented a model which combined Principal Component Analysis (PCA) with hybrid PSO to cluster data. The interrelated data have been processed by Principal Component Analysis; the results of PCA are input data for hybrid PSO clustering algorithms. It not only reduced the dimension of input variables and the scales of clustering, but also reserved the main information of original variables and eliminated the multicollinearity between the variables. It offered an effective model method for data clustering which has characters like massive, high-dimensional and heterogeneous.
机译:通过分析电子商务环境中数据挖掘的现状,并考虑了提取海量和高维数据带来的隐性和未知知识的复杂性和维数的诅咒。基于K均值聚类,粒子群优化(PSO)聚类和混合PSO聚类算法,提出了一种将主成分分析(PCA)与混合PSO相结合的聚类模型。相互关联的数据已通过主成分分析进行了处理; PCA的结果是混合PSO聚类算法的输入数据。它不仅减小了输入变量的维数和聚类的规模,而且保留了原始变量的主要信息,并消除了变量之间的多重共线性。它为数据聚类提供了一种有效的模型方法,该方法具有大规模,高维和异构等特点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号