...
首页> 外文期刊>International Journal of Web Based Communities >Community discovery algorithm under big data: taking microblog as an example
【24h】

Community discovery algorithm under big data: taking microblog as an example

机译:大数据下的社区发现算法:以微博为例

获取原文
获取原文并翻译 | 示例
           

摘要

Microblog has become a popular social media because of its short text and timely release, and its impact on society has gradually increased. In order to study the behaviour of microblog users, this paper introduced two algorithms for microblog network community division, which were community discovery algorithm based on density peak clustering and community discovery algorithm based on similarity. Then the two algorithms were simulated using MATLAB software. The data used in the experiment included the artificial network generated by LFR tool and the following information data of different users collected by taking the API interface of the microblog of a student from Computer School of Jining Normal University as the starting point by crawlers. The results demonstrated that the normalised mutual information (NMI) and the density of the community structure obtained by the two algorithms decreased, and the conductivity increased with the expansion of the scale of microblog network, and the community structure obtained by the similarity-based algorithm had higher NMI and density and lower conductivity under the same scale of micrblog network. In conclusion, the similarity-based algorithm can divide microblog network better.
机译:由于其短文本和及时释放,微博已成为一个受欢迎的社交媒体,而其对社会的影响逐渐增加。为了研究微博用户的行为,本文介绍了两种微博网络社区算法,该算法是基于相似性的密度峰集聚类和社区发现算法的社区发现算法。然后使用MATLAB软件模拟两种算法。实验中使用的数据包括由LFR工具生成的人造网络以及通过从济宁师范大学计算机学校的学生的MicroBlog的API接口作为爬行者的起点来收集不同用户的下列信息数据。结果表明,由两种算法获得的归一化互信息(NMI)和由两种算法获得的群落结构的密度随着微博网络的规模而增加,并且通过基于相似性的算法获得的社区结构增加在相同的Micrblog网络中具有更高的NMI和密度和较低的导电性。总之,基于相似性的算法可以更好地划分微博网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号