...
首页> 外文期刊>ACM transactions on knowledge discovery from data >Fast, Accurate and Provable Triangle Counting in Fully Dynamic Graph Streams
【24h】

Fast, Accurate and Provable Triangle Counting in Fully Dynamic Graph Streams

机译:完全动态图形流中的快速,准确和可证明的三角计数

获取原文
获取原文并翻译 | 示例
           

摘要

Given a stream of edge additions and deletions, how can we estimate the count of triangles in it? If we can store only a subset of the edges, how can we obtain unbiased estimates with small variances?Counting triangles (i.e., cliques of size three) in a graph is a classical problem with applications in a wide range of research areas, including social network analysis, data mining, and databases. Recently, streaming algorithms for triangle counting have been extensively studied since they can naturally be used for large dynamic graphs. However, existing algorithms cannot handle edge deletions or suffer from low accuracy.Can we handle edge deletions while achieving high accuracy? We propose ThinkD, which accurately estimates the counts of global triangles (i.e., all triangles) and local triangles associated with each node in a fully dynamic graph stream with additions and deletions of edges. Compared to its best competitors, ThinkD is (a) Accurate: up to 4.3x more accurate within the same memory budget, (b) Fast: up to 2.2x faster for the same accuracy requirements, and (c) Theoretically sound: always maintaining estimates with zero bias (i.e., the difference between the true triangle count and the expected value of its estimate) and small variance. As an application, we use ThinkD to detect suddenly emerging dense subgraphs, and we show its advantages over state-of-the-art methods.
机译:给定一系列边的添加和删除,我们如何估计其中的三角形数量?如果我们只能存储边缘的一个子集,那么如何获得方差很小的无偏估计呢?在图中计算三角形(即大小为3的团)是一个经典问题,在包括社会在内的广泛研究领域中都有应用网络分析,数据挖掘和数据库。最近,由于三角形计数的流算法自然可以用于大型动态图,因此已经对其进行了广泛的研究。但是,现有算法无法处理边缘删除或精度较低,能否在实现高精度的同时处理边缘删除?我们提出了ThinkD,它可以精确估计与完全动态图流中的每个节点相关联的全局三角形(即所有三角形)和局部三角形的数量,并带有边的添加和删除。与最佳竞争对手相比,ThinkD在以下方面具有以下优势:(a)准确:在相同的内存预算内准确度提高了4.3倍;(b)快速:对于相同的准确度要求,速度提高了2.2倍;以及(c)从理论上讲:始终保持零偏差(即,真实三角形计数与其估计的期望值之间的差)和小方差的估计。作为应用程序,我们使用ThinkD来检测突然出现的密集子图,并且我们展示了它与最新方法相比的优势。

著录项

  • 来源
    《ACM transactions on knowledge discovery from data》 |2020年第2期|12.1-12.39|共39页
  • 作者单位

    Korea Adv Inst Sci & Technol Grad Sch AI 291 Daehak Ro Daejeon 34141 South Korea|Korea Adv Inst Sci & Technol Sch Elect Engn 291 Daehak Ro Daejeon 34141 South Korea;

    Georgia Inst Technol Sch Computat Sci & Engn 756 W Peachtree St NW Atlanta GA 30308 USA;

    Inria Saclay Equipe DataShape 1 Rue Honore dEstienne dOrves F-91120 Palaiseau Ile De France France;

    Natl Univ Singapore Dept Comp Sci COM1 13 Comp Dr Singapore 117417 Singapore;

    Carnegie Mellon Univ Comp Sci Dept 5000 Forbes Ave Pittsburgh PA 15213 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Triangle counting; local triangles; edge deletions;

    机译:三角计数;局部三角形边缘缺失;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号