【24h】

Comparing Partitions by Subset Similarities

机译:按子集相似性比较分区

获取原文
获取原文并翻译 | 示例

摘要

Comparing partitions is an important issue in classification and clustering when comparing results from different methods, parameters, or initializations. A well-established method for comparing partitions is the Rand index but this index is suitable for crisp partitions only. Recently, the Hiillermeier-Rifqi index was introduced which is a generalization of the Rand index to fuzzy partitions. In this paper we introduce a new approach to comparing partitions based on the similarities of their clusters in the sense of set similarity. All three indices, Rand, Hiillermeier-Rifqi, and subset similarity, are reflexive, invariant against row permutations, and invariant against additional empty subsets. The subset similarity index is not a generalization of the Rand index, but produces similar values. Subset similarity yields more intuitive similarities than Hiillermeier-Rifqi when comparing crisp and fuzzy partitions, and yields smoother nonlinear transitions. Finally, the subset similarity index has a lower computational complexity than the Hiillermeier-Rifqi index for large numbers of objects.
机译:比较不同方法,参数或初始化的结果时,比较分区是分类和聚类中的重要问题。一种比较好的分区比较方法是Rand索引,但是该索引仅适用于明快分区。最近,引入了Hiillermeier-Rifqi指数,这是将Rand指数推广到模糊分区的方法。在本文中,我们介绍了一种在集合相似性的意义上基于分区的相似性比较分区的新方法。 Rand,Hiillermeier-Rifqi和子集相似性这三个索引都是自反的,对行排列不变,对其他空子集不变。子集相似性索引不是Rand索引的一般化,而是产生相似的值。在比较明快和模糊分区时,子集相似度比Hiillermeier-Rifqi产生更直观的相似度,并且产生更平滑的非线性过渡。最后,对于大量对象,子集相似性索引的计算复杂度低于Hiillermeier-Rifqi索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号