首页> 外国专利> Distributed document search apparatus, distributed document search method, and distributed document search program

Distributed document search apparatus, distributed document search method, and distributed document search program

机译:分布式文件搜索设备,分布式文件搜索方法和分布式文件搜索程序

摘要

PROBLEM TO BE SOLVED: To provide a distributed document search device capable of reducing calculation load for searching by reducing the times of making reference to inverted index.SOLUTION: With resect to plural inverted indexes TIC-TIF, plural phrase signatures A-F, which record if two words appear as a phrase in documents charged to the inverted indexes, are disposed in a tree shaped structure. Plural proximity signature groups gA-gF, which record if two words appear being separated away from each other by a preset distance in documents charged by the inverted indexes, are disposed in a tree shaped structure. When searching a phrase, an iquiry if the two words appear as a phrase is made to upper phrase signatures A and B, and only when the two words appear, the iquiry is made to the lower signatures. When the two words do not appear, the iquiry is not made to thereby reduce the load for search processing.
机译:解决的问题:提供一种分布式文档搜索设备,该设备能够通过减少引用反向索引的次数来减少搜索的计算量。解决方案:对于多个反向索引TIC-TIF,多个短语签名AF记录了是否两个单词以短语的形式出现在短语中,并以树状结构排列。多个接近签名组gA-gF以树状结构布置,该组gA-gF记录两个单词是否出现在由倒排索引填充的文档中彼此隔开预定距离。当搜索短语时,查询两个单词是否作为短语出现在较高短语签名A和B上,仅当出现两个单词时,才查询较低签名。当两个词没有出现时,不进行查询,从而减轻了搜索处理的负担。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号