首页> 外文会议>IEEE/WIC/ACM International Conference on Web Intelligence >Identifying Domain Experts in the Blogosphere -- Ranking Blogs Based on Topic Consistency
【24h】

Identifying Domain Experts in the Blogosphere -- Ranking Blogs Based on Topic Consistency

机译:识别Blogosphere中的领域专家-基于主题一致性的博客排名

获取原文

摘要

Current ranking algorithms, such as Page Rank, Technorati authority, and BI-Impact, favor blogs that report on a diversity of topics since those attract a large audience and thus more visitors, links, and comments. On the other side, niche blogs with a very specific topic only attract a small audience and thus have only a small reach. This results in a low ranking from today's blog retrieval systems. We argue that the consistency of a blog, i.e. how focused an author reports on a single topic, is a sign for expert knowledge. To find these blogs is particular important for other domain experts to identify blogs that they would like to follow and stay in active contact. To ease the retrieval of expert blogs, i.e. to separate them from the mass of blogs that report on random topics, we introduce a metric for blogs based on topic consistency. We divide the consistency ranking in four different aspects: (1) intra-post, (2) inter-post, (3) intra-blog, and (4) inter-blog consistency. By evaluating the metric with a test data set of 12,000 crawled blogs, we demonstrate the plausibility of our approach.
机译:当前的排名算法,例如Page Rank,Technorati授权和BI-Impact,都偏爱那些报告各种主题的博客,因为这些博客吸引了大批观众,因此吸引了更多访问者,链接和评论。另一方面,具有特定主题的利基博客仅吸引少量受众,因此影响范围很小。这导致当今博客检索系统的排名较低。我们认为,博客的一致性,即作者对单个主题的关注程度,是专家知识的标志。查找这些博客对于其他领域专家确定他们希望关注并保持活跃联系的博客尤为重要。为了简化专家博客的检索,即将它们与报告随机主题的大量博客分开,我们引入了一种基于主题一致性的博客度量。我们将一致性排名分为四个不同方面:(1)帖子内,(2)帖子间,(3)博客内和(4)博客间一致性。通过使用12,000个爬网博客的测试数据集评估该指标,我们证明了这种方法的合理性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号