...
首页> 外文期刊>International Journal of Innovative Computing Information and Control >AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA
【24h】

AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA

机译:混合数据的K均值聚类算法的初始化方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The k-means clustering algorithm is undoubtedly the most widely used partitional algorithms. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initialization of clustering. Initialization methods have been proposed to address this problem. In this paper, we present an overview of initialization methods of clustering for numerical data and categorical data respectively with an emphasis on their computational efficiency. We then propose a new initialization method for mixed data, which can obtain the good initial cluster centers using the MaxAvg distance, and give the effective k-means clustering for mixed data. Finally, the proposed method is verified on three different real world datasets from UCI Machine Learning Repository, and it is shown that the proposed method is effective and efficient for initializing and partitioning mixed data.
机译:k均值聚类算法无疑是使用最广泛的分区算法。不幸的是,由于其梯度下降特性,该算法对聚类的初始化非常敏感。已经提出了初始化方法来解决这个问题。在本文中,我们概述了分别对数值数据和分类数据进行聚类的初始化方法,重点是它们的计算效率。然后,我们提出了一种新的混合数据初始化方法,该方法可以使用MaxAvg距离获得良好的初始聚类中心,并为混合数据提供有效的k均值聚类。最后,在UCI机器学习存储库中的三个不同的现实世界数据集上验证了该方法,并表明该方法对于初始化和划分混合数据是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号