...
首页> 外文期刊>ACM Transactions on Information Systems >Boosting Search Performance Using Query Variations
【24h】

Boosting Search Performance Using Query Variations

机译:使用查询变体来提高搜索性能

获取原文
获取原文并翻译 | 示例
           

摘要

Rank fusion is a powerful technique that allows multiple sources of information to be combined into a single result set. Query variations covering the same information need represent one way in which different sources of information might arise. However, when implemented in the obvious manner, fusion over query variations is not cost-effective, at odds with the usual web-search requirement for strict per-query efficiency guarantees. In this work, we propose a novel solution to query fusion by splitting the computation into two parts: one phase that is carried out offline, to generate pre-computed centroid answers for queries addressing broadly similar information needs, and then a second online phase that uses the corresponding topic centroid to compute a result page for each query. To achieve this, we make use of score-based fusion algorithms whose costs can be amortized via the pre-processing step and that can then be efficiently combined during subsequent per-query re-ranking operations. Experimental results using the ClueWeb12B collection and the UQV100 query variations demonstrate that centroid-based approaches allow improved retrieval effectiveness at little or no loss in query throughput or latency and within reasonable pre-processing requirements. We additionally show that queries that do not match any of the pre-computed clusters can be accurately identified and efficiently processed in our proposed ranking pipeline.
机译:等级融合是一种强大的技术,它允许将多个信息源组合到一个结果集中。涵盖相同信息需求的查询变体表示可能出现不同信息源的一种方式。但是,当以明显的方式实施时,对查询变体的融合并不具有成本效益,这与通常的网络搜索要求严格的每次查询效率保证相悖。在这项工作中,我们提出了一种新的查询融合解决方案,方法是将计算分为两部分:一个阶段是离线进行的,以便为满足广泛相似信息需求的查询生成预先计算的质心答案,然后是另一个在线阶段,使用相应的主题质心为每个查询计算结果页。为了实现这一点,我们使用了基于分数的融合算法,这些算法的成本可以通过预处理步骤进行摊销,然后可以在随后的每次查询重新排序操作中有效地合并。使用ClueWeb12B集合和UQV100查询变体进行的实验结果表明,基于质心的方法可以提高查询效率,而查询吞吐量或等待时间几乎没有损失,也可以在合理的预处理要求范围内进行。我们还显示,在我们建议的排名管道中,可以准确地识别并有效地处理与任何预先计算的集群都不匹配的查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号