【24h】

Mathematical Description of Inverted Index File

机译:倒排索引文件的数学描述

获取原文
获取原文并翻译 | 示例

摘要

Inverted index files are widely used in current information retrieval systems. In inverted index files records of document number an other information are kept one by one. Such storage strategy heavily depends on large RAM and disk I/O, especially the latter, which will easily become the bottleneck. In this paper, a new mathematical description of index files are presented to reduce storage demand and to speedup retrieval procedure. We carried out experiments to test our method. Experiment result shows that the method can reduce storage of frequently appared words and improve retrieve speed at the same time. Stop words are not necessary to be filtered too.
机译:倒排索引文件广泛用于当前的信息检索系统中。在倒排索引文件中,文档编号记录将其他信息一一保存。这种存储策略在很大程度上取决于大RAM和磁盘I / O,尤其是后者,它们很容易成为瓶颈。本文提出了一种新的索引文件数学描述,以减少存储需求并加快检索过程。我们进行了实验以测试我们的方法。实验结果表明,该方法可以减少频繁出现的单词的存储量,同时提高检索速度。停用词也不必被过滤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号