首页> 外国专利> Searching and identifying of stored documents by associating score values with search results and ranking identified documents based on a weighted score value

Searching and identifying of stored documents by associating score values with search results and ranking identified documents based on a weighted score value

机译:通过将得分值与搜索结果相关联并基于加权得分值对识别出的文档进行排名,从而搜索和识别存储的文档

摘要

595709 Disclosed is a computer system (200) with a memory and at least one processor for facilitating a search and identification of documents stored in a memory device. The system (200) includes a processor which receives a query with a set of search terms, identifies an initial result set of stored documents that are relevant to the query, and identifies a document from the initial result set. An information retrieval (IR) score generating component (206) determines a score value representing a degree of similarity between the identified document and the query. The score value is based on a similarity between one of the query search terms and metadata describing subject matter of the identified document. Also included is a citation network of baseline query results which has a first set of documents that cite to the identified document and a second set of documents cited to by the identified document. A citation component (208) locates the identified document in the citation network, and determines a new score value of the identified document as a function of the score value and a quantity and a quality of documents within the first and second set of documents. An activity score component (210) locates documents in a subject matter community of the identified document, outside of the citation network, that refer to the identified document. The activity score component (210) calculates an activity score value of the identified document based at least in part on a number of times the identified document is referred to in the subject matter community and weighs the new score value with the activity score. A display device displays a report reflecting a ranking of the identified document based on the weighted new score value.
机译:595709公开了一种计算机系统(200),其具有存储器和至少一个处理器,用于促进对存储在存储设备中的文档的搜索和识别。系统(200)包括处理器,该处理器接收带有一组搜索项的查询,识别与该查询相关的已存储文档的初始结果集,并从初始结果集中识别文档。信息检索(IR)分数生成组件(206)确定代表所识别的文档和查询之间的相似度的分数值。得分值基于查询搜索词之一与描述所标识文档主题的元数据之间的相似性。还包括基线查询结果的引用网络,该网络具有引用已标识文档的第一组文档和已标识文档引用的第二组文档。引用组件(208)在引用网络中定位所识别的文档,并根据得分值以及第一和第二组文档内的文档的数量和质量来确定所识别文档的新得分值。活动评分组件(210)在引用网络之外的已标识文档的主题社区中定位引用已标识文档的文档。活动分数组件(210)至少部分地基于在主题社区中参考所标识的文档的次数来计算所标识的文档的活动分数值,并用活动分数对新分数值进行加权。显示设备基于加权的新得分值显示反映所识别文档的等级的报告。

著录项

  • 公开/公告号NZ595709A

    专利类型

  • 公开/公告日2013-06-28

    原文格式PDF

  • 申请/专利权人 LEXISNEXIS;

    申请/专利号NZ20100595709

  • 发明设计人 ZHANG LING QIN;SILVER HARRY R;

    申请日2010-03-30

  • 分类号G06F7;

  • 国家 NZ

  • 入库时间 2022-08-21 16:40:30

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号