首页> 外文会议>Workshop of the Cross-Language Evaluation Forum >Assessing the Impact of Thesaurus-Based Expansion Techniques in QA-Centric IR
【24h】

Assessing the Impact of Thesaurus-Based Expansion Techniques in QA-Centric IR

机译:评估基于词库的扩展技术在质朴中心IR中的影响

获取原文

摘要

We study the impact of using thesaurus-based query expansion methods at the Information Retrieval (IR) stage of a Question Answering (QA) system. We focus on expanding queries for questions regarding actions and events, where verbs have a central role. Two different thesaurus are used: the OpenOffice thesaurus and an automatically generated verb thesaurus, The performance of thesaurus-based methods is compared against what is obtained by (i) executing no expansion and (ii) applying a simple query generalization method. Results show that thesaurus-based approaches help improving recall at retrieval, while keeping satisfactory precision. However, we confirm that positive impact for the final QA performance is mostly achieved due to increase in recall, which can also be obtained by using simpler methods. Nevertheless, because of its better relative precision thesaurus-based expansion is effective in selectively reducing the number of irrelevant text passages retrieved, thus reducing computational load in the answer extraction stage.
机译:我们研究使用基于词库的查询扩展方法在问题回答(QA)系统的信息检索(IR)阶段的影响。我们专注于扩大关于行动和事件的问题的查询,动词具有核心作用。使用两种不同的词库:OpenOffice同义词库和自动生成的动词词库,基于词库的方法的性能与通过(i)执行没有扩展和(ii)应用简单查询概括方法的方法进行比较。结果表明,基于词库的方法有助于改善检索召回,同时保持令人满意的精度。但是,我们确认由于召回的增加,我们最终达到了最终质量QA性能的积极影响,这也可以通过使用更简单的方法来获得。然而,由于其更好的相对精确的基于精确的基于词库的扩展,可以有效地选择性地减少所检索的无关文本段落的数量,从而减少了答案提取阶段的计算负荷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号