首页> 外文会议>International Computer Science and Engineering Conference >Farthest boundary clustering algorithm: Half-orbital extreme pole
【24h】

Farthest boundary clustering algorithm: Half-orbital extreme pole

机译:最远的边界聚类算法:半轨道极端杆

获取原文

摘要

Clustering analysis is a process of splitting a dataset into various groups of smaller datasets such that instances in a particular group are more similar to one another than instances from other groups. In this paper, we propose a novel boundary approach to perform a clustering analysis. Our algorithm starts from identifying two instances that have the largest distance within the dataset, called extreme poles. The two farthest pairs of instances can either be two far ends of the same cluster group or two far ends of two different groups. Then a vector core is generated using these two poles. Various pre-determined distances from one of these two poles will split data into various layers. If the extreme poles lie within one group, then the number of instances within the layers must be distributed appropriately. Otherwise, the dataset needs to be split. Our algorithm will recursively perform on these smaller datasets until the stopping criteria are met. To demonstrate the effectiveness of our method, we compare our algorithm with the K-means clustering algorithm using the value of K from our algorithm. The results show that the total variance from our algorithm is not larger than that from the K-means algorithm.
机译:聚类分析是将数据集分成各种较小数据集的过程,使得特定组中的实例比来自其他组的实例更相似。在本文中,我们提出了一种新的边界方法来执行聚类分析。我们的算法始于识别具有数据集中最大距离的两个实例,称为极端杆。两个最远的成对的实例可以是同一集群组的两个远端或两个不同组的两端。然后使用这两极生成矢量核心。距离这两极之一的各种预定距离将分为各个层。如果极端极端在一个组内,则必须适当地分布层内的实例数。否则,需要拆分数据集。我们的算法将在这些较小的数据集上递归地执行,直到满足停止标准。为了展示我们方法的有效性,我们将算法与来自我们算法的k的值进行比较K-Means聚类算法。结果表明,我们的算法的总方差不大于K-Means算法的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号