A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

Liu Shihua; Zhang Hao; Liu Xianghua

首页> 外文期刊>The international arab journal of information technology >A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

【24h】

A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

机译：基于密度峰值的两阶段混合属性数据聚类研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Two-stage clustering framework and a clustering algorithm for mixed attribute data based on density peaks and Goodall distance are proposed. Firstly, the subset of numerical attributes of the dataset is clustered, and then the result is mapped into one-dimensional categorical attribute and added to the subset of categorical attribute data. Finally, the new dataset is clustered by the density peaks clustering algorithm to obtain the final result. Experiments on three commonly used UCI datasets show that this algorithm can effectively realize mixed attribute clustering and produce better clustering results than the traditional K-prototypes algorithm do. The clustering accuracy on the Acute, Heart and Credit datasets are 17%, 24%, and 21% higher on average than that of the K-prototypes, respectively.

机译：提出了一种基于密度峰值和GoodAll距离的两个阶段聚类框架和用于混合属性数据的聚类算法。首先，将数据集的数值子集群集，然后将结果映射到一维分类属性中，并添加到分类属性数据的子集。最后，通过密度峰值聚类算法群集新数据集以获得最终结果。三个常用的UCI数据集上的实验表明，该算法可以有效地实现混合属性聚类，并产生比传统的k原型算法更好的聚类结果。急性，心脏和信用数据集的聚类精度分别比K-原型的平均值高出17％，24％和21％。

著录项

来源
《The international arab journal of information technology》 |2021年第5期|634-643|共10页
作者
Liu Shihua; Zhang Hao; Liu Xianghua;
展开▼
作者单位

Wenzhou Polytech Dept Informat Technol Wenzhou Peoples R China;

Wenzhou Polytech Dept Informat Technol Wenzhou Peoples R China;

Wenzhou Polytech Dept Informat Technol Wenzhou Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Mixed data clustering; density peaks; k-prototypes algorithm; validity index;

机译：混合数据聚类;密度峰;K-原型算法;有效性指数;

相似文献

外文文献
中文文献
专利

1. An entropy-based density peaks clustering algorithm for mixed type data employing fuzzy neighborhood [J] . Ding Shifei, Du Mingjing, Sun Tongfeng, Knowledge-Based Systems . 2017,第octa1期

机译：基于模糊邻域的混合数据基于熵的密度峰聚类算法
2. A method of two-stage clustering learning based on improved DBSCAN and density peak algorithm [J] . Li Mingyang, Bi Xinhua, Wang Limin, Computer Communications . 2021,第Feba期

机译：一种基于改进DBSCAN和密度峰算法的两级聚类学习方法
3. Clustering algorithm for mixed datasets using density peaks and Self-Organizing Generative Adversarial Networks [J] . Balaji K., Lavanya K., Mary A. Geetha Chemometrics and Intelligent Laboratory Systems . 2020,第1期

机译：利用密度峰和自组织生成对抗网络的混合数据集聚类算法
4. MMDBC: Density-Based Clustering Algorithm for Mixed Attributes and Multi-dimension Data [C] . Haizhou Du, Wei Fang, Haining Huang, IEEE International Conference on Big Data and Smart Computing . 2018

机译：MMDBC：混合属性和多维数据的基于密度的聚类算法
5. Image reconstruction of muon tomographic data using a density-based clustering method. [D] . Perry, Kimberly B. 2015

机译：使用基于密度的聚类方法对μ子层析成像数据进行图像重建。
6. A Robust Multi-Sensor Data Fusion Clustering Algorithm Based on Density Peaks [O] . Jiande Fan, Weixin Xie, Haocui Du 2020

机译：基于密度峰值的鲁棒多传感器数据融合聚类算法
7. Clustering Mixed Data Based on Density Peaks and Stacked Denoising Autoencoders [O] . Baobin Duan, Lixin Han, Zhinan Gou, 2019

机译：基于密度峰值和堆叠的去噪自动化器聚类混合数据

A study on Two-Stage Mixed Attribute Data Clustering Based on Density Peaks

摘要

著录项

相似文献

相关主题

期刊订阅