首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph
【24h】

ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph

机译:ScaleMine:单个大图中的可伸缩并行频繁子图挖掘

获取原文

摘要

Frequent Subgraph Mining is an essential operation for graph analytics and knowledge extraction. Due to its high computational cost, parallel solutions are necessary. Existing approaches either suffer from load imbalance, or high communication and synchronization overheads. In this paper we propose ScaleMine; a novel parallel frequent subgraph mining system for a single large graph. ScaleMine introduces a novel two-phase approach. The first phase is approximate; it quickly identifies subgraphs that are frequent with high probability, while collecting various statistics. The second phase computes the exact solution by employing the results of the approximation to achieve good load balance; prune the search space; generate efficient execution plans; and guide intra-task parallelism. Our experiments show that ScaleMine scales to 8,192 cores on a Cray XC40 (12× more than competitors); supports graphs with one billion edges (10× larger than competitors), and is at least an order of magnitude faster than existing solutions.
机译:频繁的子图挖掘是图分析和知识提取的基本操作。由于计算成本高,因此需要并行解决方案。现有方法要么承受负载不平衡,要么承受高通信和同步开销。在本文中,我们提出了ScaleMine。一种用于单个大图的新颖的并行频繁子图挖掘系统。 ScaleMine引入了一种新颖的两阶段方法。第一阶段是近似的;第二阶段是近似的。在收集各种统计数据的同时,它可以快速地识别出频繁出现的子图。第二阶段通过采用近似结果来实现良好的负载平衡,从而计算出精确的解决方案;修剪搜索空间;制定有效的执行计划;并指导任务内并行性。我们的实验表明,ScaleMine可以在Cray XC40上扩展到8,192个内核(比竞争对手多12倍)。支持具有十亿条边(比竞争对手大10倍)的图,并且比现有解决方案至少快一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号