首页> 外文会议>International Conference on Parallel Processing >Massive Social Network Analysis: Mining Twitter for Social Good
【24h】

Massive Social Network Analysis: Mining Twitter for Social Good

机译:大规模的社交网络分析:社会良好的挖掘推特

获取原文

摘要

Social networks produce an enormous quantity of data. Facebook consists of over 400 million active users sharing over 5 billion pieces of information each month. Analyzing this vast quantity of unstructured data presents challenges for software and hardware. We present GraphCT, a Graph Characterization Toolkit for massive graphs representing social network data. On a 128-processor Cray XMT, GraphCT estimates the betweenness centrality of an artificially generated (R-MAT) 537 million vertex, 8.6 billion edge graph in 55 minutes and a real-world graph (Kwak, et al.) with 61.6 million vertices and 1.47 billion edges in 105 minutes. We use GraphCT to analyze public data from Twitter, a microblogging network. Twitter's message connections appear primarily tree-structured as a news dissemination system. Within the public data, however, are clusters of conversations. Using GraphCT, we can rank actors within these conversations and help analysts focus attention on a much smaller data subset.
机译:社交网络产生了巨大的数据。 Facebook由4亿有效的用户组成,每个月共用超过50亿件信息。分析这一大量非结构化数据为软件和硬件提供了挑战。我们呈现Graphict,一个用于表示社交网络数据的大规模图形的图表表征工具包。在128处理器Cray XMT上,Graphict估计人工产生的(R-MAT)537百万个顶点的中心中心,在55分钟内和一个真实世界的图表(Kwak等人)中有6160万顶点105分钟内有147亿边缘。我们使用Graphict来分析Twitter,微博网络的公共数据。 Twitter的消息连接主要显示为新闻传播系统。然而,在公共数据中,是对话集群。使用Graphict,我们可以在这些对话中排名演员,并帮助分析师将注意力集中在更小的数据子集中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号