【24h】

Blogger clustering by utilizing link information

机译:利用链接信息进行Blogger集群

获取原文

摘要

Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the blogger's browse behavior, which reflects the user's interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k-medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective
机译:博客是互联网现象的一种新形式,并且是一种不断增长的巨大信息资源,它的日期未经编辑,备受好评,个人在线评论,包括各种超链接,例如引文链接,评论链接,博客链接。这些链接可以视为博客作者的浏览行为,在一定程度上反映了用户的兴趣。因此,我们构造了一个博客发布者矩阵,在矩阵条目的计算中考虑了链接分析。利用概率潜在语义分析,将潜在变量Z的条件概率转换为后变量P的条件概率,然后将转换后的结果用于相似度计算。采用k-medoids算法进一步改善聚类结果。实验结果表明,该算法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号