...
【24h】

Social network sampling using spanning trees

机译:使用生成树的社交网络采样

获取原文
获取原文并翻译 | 示例
           

摘要

Due to the large scales and limitations in accessing most online social networks, it is hard or infeasible to directly access them in a reasonable amount of time for studying and analysis. Hence, network sampling has emerged as a suitable technique to study and analyze real networks. The main goal of sampling online social networks is constructing a small scale sampled network which preserves the most important properties of the original network. In this paper, we propose two sampling algorithms for sampling online social networks using spanning trees. The first proposed sampling algorithm finds several spanning trees from randomly chosen starting nodes; then the edges in these spanning trees are ranked according to the number of times that each edge has appeared in the set of found spanning trees in the given network. The sampled network is then constructed as a sub-graph of the original network which contains a fraction of nodes that are incident on highly ranked edges. In order to avoid traversing the entire network, the second sampling algorithm is proposed using partial spanning trees. The second sampling algorithm is similar to the first algorithm except that it uses partial spanning trees. Several experiments are conducted to examine the performance of the proposed sampling algorithms on well-known real networks. The obtained results in comparison with other popular sampling methods demonstrate the efficiency of the proposed sampling algorithms in terms of Kolmogorov-Smirnov distance (KSD), skew divergence distance (SDD) and normalized distance (ND).
机译:由于访问大多数在线社交网络的规模和限制,很难或不可行在合理的时间内直接访问它们进行研究和分析。因此,网络采样已经成为研究和分析真实网络的合适技术。对在线社交网络进行采样的主要目标是构建一个保留原始网络最重要属性的小规模采样网络。在本文中,我们提出了两种使用生成树对在线社交网络进行采样的采样算法。首先提出的采样算法从随机选择的起始节点中找到几棵生成树。然后根据给定网络中找到的生成树集合中每个边缘出现的次数对这些生成树中的边缘进行排名。然后将采样的网络构造为原始网络的子图,该子图包含入射在高度排序的边缘上的一部分节点。为了避免遍历整个网络,提出了使用部分生成树的第二种采样算法。除了使用部分生成树之外,第二种采样算法与第一种算法相似。进行了一些实验,以检查所提出的采样算法在众所周知的真实网络上的性能。与其他流行的采样方法相比,所获得的结果证明了所提出的采样算法在Kolmogorov-Smirnov距离(KSD),偏斜发散距离(SDD)和归一化距离(ND)方面的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号