首页> 外文期刊>IEICE Transactions on Information and Systems >SASUM: A Sharing-Based Approach to Fast Approximate Subgraph Matching for Large Graphs
【24h】

SASUM: A Sharing-Based Approach to Fast Approximate Subgraph Matching for Large Graphs

机译:SASUM:基于共享的大型图快速近似子图匹配方法

获取原文
获取原文并翻译 | 示例
           

摘要

Subgraph matching is a fundamental operation for querying graph-structured data. Due to potential errors and noises in real-world graph data, exact subgraph matching is sometimes inappropriate in practice. In this paper we consider an approximate subgraph matching model that allows missing edges. Based on this model, approximate subgraph matching finds all occurrences of a given query graph in a database graph, allowing missing edges. A straightforward approach is to first generate query subgraphs of a given query graph by deleting edges and then perform exact subgraph matching for each query subgraph. In this paper we propose a sharing-based approach to approximate subgraph matching, called SASUM. Our method is based on the fact that query subgraphs are highly overlapped. Due to this overlapping nature of query subgraphs, the matches of a query subgraph can be computed from the matches of a smaller query subgraph, which results in reducing the number of query subgraphs that require expensive exact subgraph matching. Our method uses a lattice framework to identify sharing opportunities between query subgraphs. To further reduce the number of graphs that need exact subgraph matching, SASUM generates small base graphs that are shared by query subgraphs and chooses the minimum number of base graphs whose matches are used to derive the matching results of all query subgraphs. A comprehensive set of experiments shows that our approach outperforms the state-of-the-art approach by orders of magnitude in terms of query execution time.
机译:子图匹配是查询图结构数据的基本操作。由于实际图形数据中可能存在错误和噪声,因此在实际中有时不适合进行精确的子图匹配。在本文中,我们考虑了允许丢失边的近似子图匹配模型。基于此模型,近似子图匹配可找到数据库图中所有出现的给定查询图,从而允许丢失边。一种直接的方法是首先通过删除边来生成给定查询图的查询子图,然后对每个查询子图执行精确的子图匹配。在本文中,我们提出了一种基于共享的近似子图匹配的方法,称为SASUM。我们的方法基于以下事实:查询子图高度重叠。由于查询子图的这种重叠性质,可以根据较小查询子图的匹配来计算查询子图的匹配,这导致减少了需要昂贵的精确子图匹配的查询子图的数量。我们的方法使用晶格框架来识别查询子图之间的共享机会。为了进一步减少需要精确子图匹配的图的数量,SASUM生成查询子图共享的小基本图,并选择其匹配用于导出所有查询子图的匹配结果的最小基本图数。一组全面的实验表明,在查询执行时间方面,我们的方法比最先进的方法好几个数量级。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2013年第3期|624-633|共10页
  • 作者单位

    The authors are with the Department of Computer Science,KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea;

    The authors are with the Department of Computer Science,KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea;

    The authors are with the Department of Computer Science,KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea;

    The authors are with the Department of Computer Science,KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    graph database; approximate subgraph matching;

    机译:图形数据库近似子图匹配;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号