Density Based Initialization Method for K-Means Clustering Algorithm

Ajay Kumar; Shishir Kumar

首页> 外文期刊>International Journal of Intelligent Systems and Applications >Density Based Initialization Method for K-Means Clustering Algorithm

【24h】

Density Based Initialization Method for K-Means Clustering Algorithm

机译：K均值聚类算法的基于密度的初始化方法

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data clustering is a basic technique to show the structure of a data set. K-means clustering is a widely acceptable method of data clustering, which follow a partitioned approach for dividing the given data set into non-overlapping groups. Unfortunately, it has the pitfall of randomly choosing the initial cluster centers. Due to its gradient nature, this algorithm is highly sensitive to the initial seed value. In this paper, we propose a kernel density-based method to compute an initial seed value for the k-means algorithm. The idea is to select an initial point from the denser region because they truly reflect the property of the overall data set. Subsequently, we are avoiding the selection of outliers as an initial seed value. We have verified the proposed method on real data sets with the help of different internal and external validity measures. The experimental analysis illustrates that the proposed method has better performance over the k-means, k-means++ algorithm, and other recent initialization methods.

机译：数据聚类是显示数据集结构的基本技术。 K均值聚类是一种广泛接受的数据聚类方法，它遵循一种分区方法，用于将给定数据集划分为非重叠组。不幸的是，它具有随机选择初始聚类中心的陷阱。由于其梯度性质，该算法对初始种子值高度敏感。在本文中，我们提出了一种基于核密度的方法来计算k均值算法的初始种子值。这样做的想法是从较密集的区域中选择一个初始点，因为它们确实反映了整个数据集的属性。随后，我们避免选择离群值作为初始种子值。我们已经借助不同的内部和外部有效性度量对真实数据集验证了该方法。实验分析表明，该方法具有优于k-means，k-means ++算法和其他近期初始化方法的性能。

著录项

来源
《International Journal of Intelligent Systems and Applications》 |2017年第10期|共9页
作者
Ajay Kumar; Shishir Kumar;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Two density-based k-means initialization algorithms for non-metric data clustering [J] . Bianchi Filippo Maria, Livi Lorenzo, Rizzi Antonello Pattern Analysis and Applications . 2016,第3期

机译：非度量数据聚类的两种基于密度的k均值初始化算法
2. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
3. DIMK-means “Distance-based Initialization Method for K-means Clustering Algorithm” [J] . Raed T. Aldahdooh, Wesam Ashour International Journal of Intelligent Systems and Applications . 2013,第2期

机译：DIMK-均值“ K-均值聚类算法的基于距离的初始化方法”
4. A Density-Based Method for Selection of the Initial Clustering Centers of K-means Algorithm [C] . Xin Du, Ning Xu, Cailan Zhou, IEEE Advanced Information Technology, Electronic and Automation Control Conference . 2017

机译：基于密度的选择方法，用于选择K-Means算法的初始聚类中心
5. A K-means based watershed imaging segmentation algorithm for banana cluster quality inspection. [D] . Castillo Cepin, Gregorio Alfonso. 2016

机译：基于K均值的分水岭成像分割算法用于香蕉簇质量检测。
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较

Density Based Initialization Method for K-Means Clustering Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅