首页> 外文期刊>Computer Science & Information Technology >An Enhanced Lucene based System for Efficient Document/Information Retrieval
【24h】

An Enhanced Lucene based System for Efficient Document/Information Retrieval

机译:基于增强的Lucene基于高效文件/信息检索的系统

获取原文
           

摘要

In this paper we implement a document retrieval system using the Lucene tool and we conduct some experiments in order to compare the efficiency of two different weighting schema: the well-known TF-IDF and the BM25. Then, we expand queries using a comparable corpus (wikipedia) and word embeddings. Obtained results show that the latter method (word embeddings) is a good way to achieve higher precision rates and retrieve more accurate documents.
机译:在本文中,我们使用Lucene工具实施文档检索系统,我们进行了一些实验,以比较两个不同加权模式的效率:众所周知的TF-IDF和BM25。然后,我们使用可比较的语料库(维基百科)和W​​ord Embeddings展开查询。获得的结果表明,后一种方法(Word Embeddings)是实现更高的精度速率的好方法,并检索更准确的文档。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号