【24h】

MFCRank: A Web Ranking Algorithm Based on Correlation of Multiple Features

机译:MFCRank:一种基于多个特征相关性的Web排名算法

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a new ranking algorithm MFCRank for topic-specific Web search systems. The basic idea is to correlate two types of similarity information into a unified link analysis model so that the rich content and link features in Web collections can be exploited efficiently to improve the ranking performance. First, a new surfer model JBC is proposed, under which the topic similarity information among neighborhood pages is used to weigh the jumping probability of the surfer and to direct the surfing activities. Secondly, as JBC surfer model is still query-independent, a correlation between the query and JBC is essential. This is implemented by the definition of MFCRank score, which is the linear combination of JBC score and the similarity value between the query and the matched pages. Through the two correlation steps, the features contained in the plain text, link structure, anchor text and user query can be smoothly correlated in one single ranking model. Ranking experiments have been carried out on a set of topic-specific Web page collections. Experimental results showed that our algorithm gained great improvement with regard to the ranking precision.
机译:本文提出了一种针对主题特定的Web搜索系统的新排名算法MFCRank。基本思想是将两种类型的相似性信息关联到统一的链接分析模型中,以便可以有效利用Web集合中的丰富内容和链接功能,从而提高排名性能。首先,提出了一种新的冲浪者模型JBC,在该模型中,邻域页面之间的主题相似度信息用于权衡冲浪者的跳跃概率并指导冲浪活动。其次,由于JBC冲浪者模型仍然与查询无关,因此查询和JBC之间的相关性至关重要。这是通过MFCRank分数的定义实现的,该分数是JBC分数和查询与匹配页面之间的相似度值的线性组合。通过两个相关步骤,可以在一个单一的排名模型中平滑地关联纯文本,链接结构,锚文本和用户查询中包含的功能。已经对一组特定于主题的网页集进行了排名实验。实验结果表明,该算法在排序精度上有了很大的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号