首页> 外文会议>IEEE International Conference on Data Mining >Speedup Clustering with Hierarchical Ranking
【24h】

Speedup Clustering with Hierarchical Ranking

机译:使用分层排名加快群集

获取原文

摘要

Many clustering algorithms in particular hierarchical clustering algorithms do not scale-up well for large data-sets especially when using an expensive distance function. In this paper, we propose a novel approach to perform approximate clustering with high accuracy. We introduce the concept of a pairwise hierarchical ranking to efficiently determine close neighbors for every data object. Empirical results on synthetic and real-life data show a speedup of up to two orders of magnitude over OPTICS while maintaining a high accuracy and up to one order of magnitude over the previously proposed DATA BUBBLES method, which also tries to speedup OPTICS by trading accuracy for speed.
机译:特定分层聚类算法中的许多聚类算法在大数据集中不展示很好,特别是在使用昂贵的距离功能时。在本文中,我们提出了一种以高精度执行近似聚类的新方法。我们介绍了成对分层排名的概念,以有效地确定每个数据对象的封闭邻居。合成和现实生活数据的经验结果显示出高达两个音量的大量级别,同时通过先前提出的数据泡沫方法保持高精度和高达一个数量级,这也试图通过交易准确度进行加速光学器件速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号