首页> 外文会议>Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Jul 23-26, 2002, Edmonton >SyMP: An Efficient Clustering Approach to Identify Clusters of Arbitrary Shapes in Large Data Sets
【24h】

SyMP: An Efficient Clustering Approach to Identify Clusters of Arbitrary Shapes in Large Data Sets

机译:SyMP:一种有效的聚类方法,用于识别大型数据集中的任意形状的聚类

获取原文

摘要

We propose a new clustering algorithm, called SyMP, which is based on synchronization of pulse-coupled oscillators. SyMP represents each data point by an Integrate-and-Fire oscillator and uses the relative similarity between the points to model the interaction between the oscillators. SyMP is robust to noise and outliers, determines the number of clusters in an unsupervised manner, identifies clusters of arbitrary shapes, and can handle very large data sets. The robustness of SyMP is an intrinsic property of the synchronization mechanism. To determine the optimum number of clusters, SyMP uses a dynamic resolution parameter. To identify clusters of various shapes, SyMP models each cluster by multiple Gaussian components. The number of components is automatically determined using a dynamic intra-cluster resolution parameter. Clusters with simple shapes would be modeled by few components while clusters with more complex shapes would require a larger number of components. The scalable version of SyMP uses an efficient incremental approach that requires a simple pass through the data set. The proposed clustering approach is empirically evaluated with several synthetic and real data sets, and its performance is compared with CURE.
机译:我们提出了一种新的聚类算法,称为SEMP,基于脉冲耦合振荡器的同步。 Symp表示集成和灭火振荡器的每个数据点,并使用点之间的相对相似度来模拟振荡器之间的交互。 Symp对噪声和异常值强大,以无监督的方式确定群集数,识别任意形状的集群,并且可以处理非常大的数据集。 Symp的稳健性是同步机制的内在属性。要确定最佳群集数,Symp使用动态分辨率参数。要识别各种形状的群集,Symp模拟了多个高斯组件的每个集群。使用动态帧内分辨率参数自动确定组件的数量。具有简单形状的簇将由很少的组件建模,而具有更复杂的形状的簇将需要更多的组件。 Symp的可扩展版本使用了一个有效的增量方法,需要简单的通过数据集。所提出的聚类方法是用若干合成和真实数据集进行经验评估的,其性能与固化进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号