...
首页> 外文期刊>The international arab journal of information technology >A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks
【24h】

A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

机译:基于密度峰值的两阶段混合属性数据聚类研究

获取原文
获取原文并翻译 | 示例
           

摘要

A Two-stage clustering framework and a clustering algorithm for mixed attribute data based on density peaks and Goodall distance are proposed. Firstly, the subset of numerical attributes of the dataset is clustered, and then the result is mapped into one-dimensional categorical attribute and added to the subset of categorical attribute data. Finally, the new dataset is clustered by the density peaks clustering algorithm to obtain the final result. Experiments on three commonly used UCI datasets show that this algorithm can effectively realize mixed attribute clustering and produce better clustering results than the traditional K-prototypes algorithm do. The clustering accuracy on the Acute, Heart and Credit datasets are 17%, 24%, and 21% higher on average than that of the K-prototypes, respectively.
机译:提出了一种基于密度峰值和GoodAll距离的两个阶段聚类框架和用于混合属性数据的聚类算法。 首先,将数据集的数值子集群集,然后将结果映射到一维分类属性中,并添加到分类属性数据的子集。 最后,通过密度峰值聚类算法群集新数据集以获得最终结果。 三个常用的UCI数据集上的实验表明,该算法可以有效地实现混合属性聚类,并产生比传统的k原型算法更好的聚类结果。 急性,心脏和信用数据集的聚类精度分别比K-原型的平均值高出17%,24%和21%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号