...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >An Iterative MapReduce Based Frequent Subgraph Mining Algorithm
【24h】

An Iterative MapReduce Based Frequent Subgraph Mining Algorithm

机译:基于迭代MapReduce的频繁子图挖掘算法

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent subgraph mining (FSM) is an important task for exploratory data analysis on graph data. Over the years, many algorithms have been proposed to solve this task. These algorithms assume that the data structure of the mining task is small enough to fit in the main memory of a computer. However, as the real-world graph data grows, both in size and quantity, such an assumption does not hold any longer. To overcome this, some graph database-centric methods have been proposed in recent years for solving FSM; however, a distributed solution using MapReduce paradigm has not been explored extensively. Since MapReduce is becoming the de-facto paradigm for computation on massive data, an efficient FSM algorithm on this paradigm is of huge demand. In this work, we propose a frequent subgraph mining algorithm called which uses an iterative MapReduce based framework. is complete as it returns all the frequent subgraphs for a given user-defined support, and it is efficient as it applies all the optimizations that the latest FSM algorithms adopt. Our experiments with real life and large synthetic datasets validate the effectiveness of for mining frequent subgraphs from large graph datasets. The source code of is available from www.cs.iupui.edu/~alhasan/software/
机译:频繁的子图挖掘(FSM)是对图数据进行探索性数据分析的重要任务。多年来,已经提出了许多算法来解决该任务。这些算法假定挖掘任务的数据结构足够小以适合计算机的主内存。但是,随着现实世界中图形数据的增长(无论大小还是数量),这种假设不再成立。为了克服这个问题,近年来提出了一些以图形数据库为中心的方法来解决FSM。但是,尚未广泛探索使用MapReduce范例的分布式解决方案。由于MapReduce成为用于处理海量数据的实际范例,因此对这种范例的高效FSM算法有巨大的需求。在这项工作中,我们提出了一种称为子图的频繁挖掘算法,该算法使用基于MapReduce的迭代框架。因为它返回了给定的用户定义支持的所有频繁子图,所以它是完整的,并且由于应用了最新FSM算法采用的所有优化而非常有效。我们对现实生活和大型合成数据集的实验验证了从大型图形数据集中挖掘频繁子图的有效性。的源代码可从www.cs.iupui.edu/~alhasan/software/获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号