首页> 外文会议>International colloquium on automata, languages and programming >Approximation Algorithms for Min-Sum k-Clustering and Balanced k-Median
【24h】

Approximation Algorithms for Min-Sum k-Clustering and Balanced k-Median

机译:最小和k聚类和平衡k中值的近似算法

获取原文

摘要

We consider two closely related fundamental clustering problems in this paper. In the Min-Sum k-Clustering problem, one is given a metric space and has to partition the points into k clusters while minimizing the total pairwise distances between the points assigned to the same cluster. In the Balanced k-Median problem, the instance is the same and one has to obtain a partitioning into k clusters C_1,..., C_k, where each cluster C_i has a center c_i, while minimizing the total assignment costs for the points in the metric; here the cost of assigning a point j to a cluster C_i is equal to |C_i| times the distance between j and c_i in the metric. In this paper, we present an O(logn)-approximation for both these problems where n is the number of points in the metric that are to be served. This is an improvement over the O(∈~(-1) log~(1+∈) n)-approximation (for any constant ∈ > 0) obtained by Bartal, Charikar, and Raz [STOC 01]. We also obtain a quasi-PTAS for Balanced k-Median in metrics with constant doubling dimension. As in the work of Bartal et al., our approximation for general metrics uses embeddings into tree metrics. The main technical contribution in this paper is an O(1)-approximation for Balanced k-Median in hierarchically separated trees (HSTs). Our improvement comes from a more direct dynamic programming approach that heavily exploits properties of standard HSTs. In this way, we avoid the reduction to special types of HSTs that were considered by Bartal et al., thereby avoiding an additional O(∈~(-1) log~∈ n) loss.
机译:在本文中,我们考虑了两个密切相关的基本聚类问题。在最小和k聚类问题中,给定一个度量空间,并且必须将这些点划分为k个聚类,同时最小化分配给同一聚类的点之间的总成对距离。在Balanced k-Median问题中,实例相同,必须将其划分为k个群集C_1,...,C_k,其中每个群集C_i具有一个中心c_i,同时最小化点中的总分配成本指标;这里,将点j分配给群集C_i的成本等于| C_i |乘以度量中j和c_i之间的距离。在本文中,我们针对这两个问题都给出了O(logn)逼近,其中n是度量中要服务的点数。这是对Bartal,Charikar和Raz [STOC 01]获得的O(∈〜(-1)log〜(1 +∈)n)逼近(任何常数∈> 0)的改进。我们还获得了具有恒定倍增维的度量的平衡k中值的准PTAS。正如Bartal等人的工作一样,我们对通用指标的近似使用了嵌入到树指标中的方法。本文的主要技术贡献是分层分离树(HST)中平衡k中值的O(1)逼近。我们的改进来自更直接的动态编程方法,该方法大量利用了标准HST的属性。这样,我们避免了Bartal等人考虑的减少特殊类型的HST,从而避免了额外的O(∈〜(-1)log〜∈n)损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号