An Enhanced Lucene based System for Efficient Document/Information Retrieval

Alaidine Ben Ayed; Isma?l Biskri; Jean-Guy Meunier

首页> 外文期刊>Computer Science & Information Technology >An Enhanced Lucene based System for Efficient Document/Information Retrieval

【24h】

An Enhanced Lucene based System for Efficient Document/Information Retrieval

机译：基于增强的Lucene基于高效文件/信息检索的系统

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In this paper we implement a document retrieval system using the Lucene tool and we conduct some experiments in order to compare the efficiency of two different weighting schema: the well-known TF-IDF and the BM25. Then, we expand queries using a comparable corpus (wikipedia) and word embeddings. Obtained results show that the latter method (word embeddings) is a good way to achieve higher precision rates and retrieve more accurate documents.

机译：在本文中，我们使用Lucene工具实施文档检索系统，我们进行了一些实验，以比较两个不同加权模式的效率：众所周知的TF-IDF和BM25。然后，我们使用可比较的语料库（维基百科）和Word Embeddings展开查询。获得的结果表明，后一种方法（Word Embeddings）是实现更高的精度速率的好方法，并检索更准确的文档。

著录项

来源
《Computer Science & Information Technology》 |2020年第9期|共7页
作者
Alaidine Ben Ayed; Isma?l Biskri; Jean-Guy Meunier;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Internet and Web ApplicationsData and knowledge RepresentationDocument Retrieval.;

机译：Internet和Web应用程序Data和知识代表Document Retrival。;

An Enhanced Lucene based System for Efficient Document/Information Retrieval

摘要

著录项

相关主题

期刊订阅