...
首页> 外文期刊>Knowledge and information systems >TKAP: Efficiently processing top-k query on massive data by adaptive pruning
【24h】

TKAP: Efficiently processing top-k query on massive data by adaptive pruning

机译:TKAP:通过自适应修剪有效处理海量数据的top-k查询

获取原文
获取原文并翻译 | 示例
           

摘要

In many applications, top-k query is an important operation to return a set of interesting points in a potentially huge data space. The existing algorithms, either maintaining too many candidates, or requiring assistant structures built on the specific attribute subset, or returning results with probabilistic guarantee, cannot process top-k query on massive data efficiently. This paper proposes a sorted-list-based TKAP algorithm, which utilizes some data structures of low space overhead, to efficiently compute top-k results on massive data. In round-robin retrieval on sorted lists, TKAP performs adaptive pruning operation and maintains the required candidates until the stop condition is satisfied. The adaptive pruning operation can be adjusted by the information obtained in round-robin retrieval to achieve a better pruning effect. The adaptive pruning rule is developed in this paper, along with its theoretical analysis. The extensive experimental results, conducted on synthetic and real-life data sets, show the significant advantage of TKAP over the existing algorithms.
机译:在许多应用程序中,top-k查询是一项重要操作,可以在潜在的巨大数据空间中返回一组有趣的点。现有的算法,要么保持太多的候选者,要么需要在特定属性子集上建立辅助结构,或者返回具有概率保证的结果,就无法有效地处理海量数据的top-k查询。本文提出了一种基于排序列表的TKAP算法,该算法利用低空间开销的一些数据结构来有效地计算海量数据的top-k结果。在排序列表的循环检索中,TKAP执行自适应修剪操作并保留所需的候选对象,直到满足停止条件为止。可以通过循环检索中获得的信息来调整自适应修剪操作,以实现更好的修剪效果。本文提出了一种自适应修剪规则,并对其进行了理论分析。在合成和真实数据集上进行的大量实验结果表明,TKAP相对于现有算法具有显着优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号