...
首页> 外文期刊>Data in Brief >Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
【24h】

Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets

机译:集群异构数据集进化聚类算法星的绩效评估结果

获取原文
           

摘要

This article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*) compared to five traditional and modern clustering algorithms. Two experimental methods are employed to examine the performance of ECA* against genetic algorithm for clustering++ (GENCLUST++), learning vector quantisation (LVQ), expectation maximisation (EM), K-means++ (KM++) and K-means (KM). These algorithms are applied to 32 heterogenous and multi-featured datasets to determine which one performs well on the three tests. For one, ther paper examines the efficiency of ECA* in contradiction of its corresponding algorithms using clustering evaluation measures. These validation criteria are objective function and cluster quality measures. For another, it suggests a performance rating framework to measurethe the performance sensitivity of these algorithms on varos dataset features (cluster dimensionality, number of clusters, cluster overlap, cluster shape and cluster structure). The contributions of these experiments are two-folds: (i) ECA* exceeds its counterpart aloriths in ability to find out the right cluster number; (ii) ECA* is less sensitive towards dataset features compared to its competitive techniques. Nonetheless, the results of the experiments performed demonstrate some limitations in the ECA*: (i) ECA* is not fully applied based on the premise that no prior knowledge exists; (ii) Adapting and utilising ECA* on several real applications has not been achieved yet.
机译:本文介绍了用于评估进化聚类算法明星(ECA *)的性能的数据与五种传统和现代聚类算法相比。采用两种实验方法检查ECA *对聚类++(Genclust ++)的遗传算法的性能,学习矢量定量(LVQ),期望最大化(EM),K-Means ++(KM ++)和K-Means(Km)。这些算法应用于32个异构和多颗粒数据集,以确定哪一个在三个测试中执行良好。对于一种,纸张通过聚类评估措施审查了ECA *的效率矛盾。这些验证标准是客观函数和集群质量措施。对于另一个,它表明了一种绩效评级框架,用于测量对Varos数据集特征的这些算法的性能敏感性(集群维度,集群数,群集重叠,群集形状和群集结构)。这些实验的贡献是两倍:(i)ECA *超出其在能够找出正确的聚类号码的同行阶层; (ii)与其竞争技术相比,ECA *对数据集特征不太敏感。尽管如此,所执行的实验结果表明了ECA *的一些限制:(i)ECA *未得到完全申请的基础,没有现有知识存在; (ii)在几个实际应用中调整和利用ECA *尚未实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号