k-Means clustering with a new divergence-based distance metric: Convergence and performance analysis

Chakraborty Saptarshi; Das Swagatam

首页> 外文期刊>Pattern recognition letters >k-Means clustering with a new divergence-based distance metric: Convergence and performance analysis

【24h】

k-Means clustering with a new divergence-based distance metric: Convergence and performance analysis

机译：具有新的基于散度的距离度量的k-Means聚类：收敛和性能分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The choice of a proper similarity/dissimilarity measure is very important in cluster analysis for revealing the natural grouping in a given dataset. Choosing the most appropriate measure has been an open problem for many years in cluster analysis. Among various approaches of incorporating a non-Euclidean dissimilarity measure for clustering, use of the divergence-based distance functions has recently gained attention in the perspective of partitional clustering. Following this direction, we propose a new point-to-point distance measure called the S-distance motivated from the recently developed S-divergence measure (originally defined on the open cone of positive definite matrices) and discuss some of its important properties. We subsequently develop the S - k-means algorithm (with Lloyd's heuristic) which replaces the conventional Euclidean distance of k-means with the S-distance. We also provide a theoretical analysis of the S - k-means algorithm establishing the convergence of the obtained partial optimal solutions to a locally optimal solution. The performance of S - k-means is compared with the classical k-means algorithm with Euclidean distance metric and its feature-weighted variants using several synthetic and real-life datasets. The comparative study indicates that our results are appealing, especially when the distribution of the clusters is not regular. (C) 2017 Elsevier B.V. All rights reserved.

机译：在聚类分析中，为了揭示给定数据集中的自然分组，选择适当的相似性/差异性度量非常重要。在聚类分析中，选择最合适的方法多年来一直是一个未解决的问题。在将非欧几里得差异度量用于聚类的各种方法中，基于分区的距离函数的使用最近在分区聚类的角度得到了关注。按照这个方向，我们提出了一种新的点对点距离度量，该度量是根据最近开发的S-散度度量（最初在正定矩阵的开放圆锥上定义）得出的，并讨论了其中的一些重要特性。随后，我们开发了S-k-means算法（采用劳埃德启发式算法），该算法用S距离替换了传统的k-means欧几里得距离。我们还提供了S-k-均值算法的理论分析，建立了获得的局部最优解与局部最优解的收敛性。使用几个合成的和真实的数据集，将S-k-means的性能与具有欧几里得距离度量标准的经典k-means算法及其特征加权变量进行了比较。对比研究表明，我们的结果具有吸引力，尤其是当群集的分布不规则时。（C）2017 Elsevier B.V.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2017年第1期|67-73|共7页
作者
Chakraborty Saptarshi; Das Swagatam;
展开▼
作者单位

Indian Stat Inst, Stat & Math Unit, Kolkata 700108, India;

Indian Stat Inst, Elect & Commun Sci Unit, Kolkata 700108, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
k-means clustering; S-distance; S-divergence; Lloyd's heuristics; Convergence;

机译：k-均值聚类;S-距离;S-散度;劳埃德启发法;收敛;

相似文献

外文文献
中文文献
专利

1. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2019,第1期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
2. Improving performance of classification on severity of ill effects (SEV) index on fish using K-Means clustering algorithm with various distance metrics [J] . Khakzad Hamid Water Practice and Technology . 2018,第4期

机译：使用具有各种距离指标的K-Means聚类算法提高对鱼类的病害严重性（SEV）指数的分类性能
3. Distance Based Hybrid Approach for Cluster Analysis Using Variants of K-means and Evolutionary Algorithm [J] . O.A. Mohamed Jafar, R. Sivakumar Research journal of applied science, engineering and technology . 2014,第11期

机译：基于距离的K均值和进化算法的聚类分析混合方法
4. Performance Analysis of Uncertain K-means Clustering Algorithm Using Different Distance Metrics [C] . Swati Aggarwal, Nitika Agarwal, Monal Jain International Conference on Computational Intelligence . 2019

机译：不同距离度量的不确定K均值聚类算法的性能分析
5. Performance analysis of EM-MPM and K-means clustering in 3D ultrasound breast image segmentation [D] . Yang, Huanyi 2013

机译：EM-MPM和K-means聚类在3D超声乳腺图像分割中的性能分析
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. IMPACT OF DISTANCE METRICS ON THE PERFORMANCE OF K-MEANS AND FUZZY C-MEANS CLUSTERING – AN APPROACH TO ASSESS STUDENT’S PERFORMANCE IN E-LEARNING ENVIRONMENT [O] . V.P. Mahatme 2018

机译：距离指标对k型和模糊C型聚类性能的影响 - 一种评估学生电子学习环境性能的方法

k-Means clustering with a new divergence-based distance metric: Convergence and performance analysis

摘要

著录项

相似文献

相关主题

期刊订阅