...
首页> 外文期刊>TEST >On H?lder fields clustering
【24h】

On H?lder fields clustering

机译:在H?lder字段上聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Based on n randomly drawn vectors in a Hilbert space, we study the k-means clustering scheme. Here, clustering is performed by computing the Voronoi partition associated with centers that minimize an empirical criterion, called distorsion. The performance of the method is evaluated by comparing the theoretical distorsion of empirical optimal centers to the theoretical optimal distorsion. Our first result states that, provided that the underlying distribution satisfies an exponential moment condition, an upper bound for the above performance criterion is $O(1/sqrt{n})$ . Then, motivated by a broad range of applications, we focus on the case where the data are real-valued random fields. Assuming that they share a H?lder property in quadratic mean, we construct a numerically simple k-means algorithm based on a discretized version of the data. With a judicious choice of the discretization, we prove that the performance of this algorithm matches the performance of the classical algorithm.
机译:基于希尔伯特空间中n个随机绘制的向量,我们研究了k均值聚类方案。在此,通过计算与最小化经验标准(称为失真)的中心相关的Voronoi分区来执行聚类。通过将经验最优中心的理论失真与理论最优失真进行比较来评估该方法的性能。我们的第一个结果指出,假设基础分布满足指数矩条件,则上述性能标准的上限为$ O(1 / sqrt {n})$。然后,受广泛应用的启发,我们将重点放在数据是实值随机字段的情况下。假设它们以二次均值共享H?lder属性,我们基于数据的离散化版本构造一个数值上简单的k均值算法。通过对离散化的明智选择,我们证明了该算法的性能与经典算法的性能相匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号