【24h】

The best clustering algorithms in data mining

机译:数据挖掘中最好的聚类算法

获取原文

摘要

In data mining, Clustering is the most popular, powerful and commonly used unsupervised learning technique. It is a way of locating similar data objects into clusters based on some similarity. Clustering algorithms can be categorized into seven groups, namely Hierarchical clustering algorithm, Density-based clustering algorithm, Partitioning clustering algorithm, Graph-based algorithm, Grid-based algorithm, Model-based clustering algorithm and Combinational clustering algorithm. These clustering algorithms give different result according to the conditions. Some clustering techniques are better for large data set and some gives good result for finding cluster with arbitrary shapes. This paper is planned to learn and relates various data mining clustering algorithms. Algorithms which are under exploration as follows: K-Means algorithm, K-Medoids, Distributed K-Means clustering algorithm, Hierarchical clustering algorithm, Grid-based Algorithm and Density based clustering algorithm. This paper compared all these clustering algorithms according to the many factors. After comparison of these clustering algorithms I describe that which clustering algorithms should be used in different conditions for getting the best result.
机译:在数据挖掘中,聚类是最流行,功能强大且最常用的无监督学习技术。这是一种基于相似性将相似数据对象定位到群集中的方法。聚类算法可以分为七类,分别是分层聚类算法,基于密度的聚类算法,分区聚类算法,基于图的算法,基于网格的算法,基于模型的聚类算法和组合聚类算法。这些聚类算法根据条件给出不同的结果。对于大数据集,某些聚类技术更好,而对于任意形状的聚类,某些聚类技术可以提供良好的结果。本文计划学习和关联各种数据挖掘聚类算法。正在研究的算法如下:K-Means算法,K-Medoids,分布式K-Means聚类算法,分层聚类算法,基于网格的算法和基于密度的聚类算法。本文根据许多因素对所有这些聚类算法进行了比较。在比较了这些聚类算法之后,我描述了应在不同条件下使用哪种聚类算法以获得最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号