...
首页> 外文期刊>ACM SIGIR FORUM >Automatically Extracting High-Quƒality Negative Examples for Answer Selection in Quƒestion Answering
【24h】

Automatically Extracting High-Quƒality Negative Examples for Answer Selection in Quƒestion Answering

机译:自动提取高质否定样本,用于问题答案中的答案选择

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a heuristic called “one answer per document” for automaticallyrnextracting high-quality negative examples for answerrnselection in question answering. Starting with a collection ofrnquestion–answer pairs from the popular TrecQA dataset, we identifyrnthe original documents from which the answers were drawn.rnSentences from these source documents that contain query termsrn(aside from the answers) are selected as negative examples. Trainingrnon the original data plus these negative examples yields improvementsrnin e‚ectiveness by a margin that is comparable to successivernrecent publications on this dataset. Our technique is completelyrnunsupervised, which means that the gains come essentially for free.rnWe con€rm that the improvements can be directly aŠributed to ourrnheuristic, as other approaches to extracting comparable amountsrnof training data are not e‚ective. Beyond the empirical validationrnof this heuristic, we also share our improved TrecQA dataset withrnthe community to support further work in answer selection.
机译:我们提出一种启发式的方法,即“每个文档一个答案”,用于自动提取高质量的否定示例,以便在问题回答中选择答案。从流行的TrecQA数据集中的问题-答案对的集合开始,我们确定从中提取答案的原始文档。这些源文档中包含查询词的句子(除了答案)被选为否定示例。 Trainernon原始数据加上这些否定示例可提高有效性,其有效性可与该数据集上的后续最新出版物相提并论。我们的技术是完全不受监督的,这意味着收益本质上是免费的。我们认为,可以直接将改进应用于我们的启发式方法,因为其他方法无法提取可比的数量的训练数据。除了这种启发式的经验验证之外,我们还与社区共享改进的TrecQA数据集,以支持答案选择方面的进一步工作。

著录项

  • 来源
    《ACM SIGIR FORUM》 |2017年第cd期|797-800|共4页
  • 作者单位

    David R. Cheriton School of Computer Science, University of Waterloo;

    Department of Computer Science, University of Maryland;

    David R. Cheriton School of Computer Science, University of Waterloo;

    Department of Management Sciences, University of Waterloo;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号