【24h】

Learning to Rank with Groups

机译:学习与小组排名

获取原文

摘要

An essential issue in document retrieval is ranking, and the documents are ranked by their expected relevance to a given query. Multiple labels are used to represent different level of relevance for documents to a given query, and the corresponding label values are used to quantify the relevance of the documents. According to the training set for a given query, the documents can be divided into several groups. Specifically, the documents with the same label are assigned to the same group. If the documents in the group with higher relevance label can always be ranked higher over the ones in groups with lower relevance label by a ranking model, it is reasonable to expect perfect ranking performance. Inspired by this idea, we propose a novel framework for learning to rank, which depends on two new samples. The first one is one-group constituted by one document with higher level label and a group of documents with lower level label; the second one is group-group constituted by a group of documents with higher level label and a group of documents with lower level label. A novel loss function is proposed based on the likelihood loss similar to ListMLE. We demonstrate the advantages of our approaches on the Letor 3.0 data set. Experimental results show that our approaches are effective in improving the ranking performance.
机译:文档检索中的一个重要问题是排名,并且根据文档与给定查询的预期相关性对文档进行排名。多个标签用于表示文档与给定查询的不同相关级别,并且相应的标签值用于量化文档的相关性。根据给定查询的训练集,文档可以分为几组。具体来说,具有相同标签的文档将分配给相同的组。如果通过排名模型可以始终将具有较高相关性标签的组中的文档始终排在具有较低相关性标签的组中,则可以合理地期望获得理想的排名性能。受此想法的启发,我们提出了一个学习排名的新颖框架,该框架取决于两个新样本。第一个是一组,由一个带有较高级别标签的文档和一组带有较低级别标签的文档组成;第二组是由一组具有较高级别标签的文档和一组具有较低级别标签的文档组成的组-组。基于类似于ListMLE的似然损失,提出了一种新颖的损失函数。我们在Letor 3.0数据集上展示了我们的方法的优势。实验结果表明,我们的方法可以有效地提高排名效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号