首页> 外国专利> KEYWORD EXTRACTION METHOD AND KEYWORD EXTRACTION DEVICE FOR DOCUMENT DATABASE, COMPUTER PROGRAM AND PROGRAM STORAGE MEDIUM

KEYWORD EXTRACTION METHOD AND KEYWORD EXTRACTION DEVICE FOR DOCUMENT DATABASE, COMPUTER PROGRAM AND PROGRAM STORAGE MEDIUM

机译:文档数据库,计算机程序和程序存储介质的关键词提取方法和关键词提取装置

摘要

PROBLEM TO BE SOLVED: To precisely specify keywords characterizing each document, and to grasp the contents of each document at a glace in a document database in which a plurality of documents related with a specific field are summarized.;SOLUTION: This keyword extraction method in a document database is provided for making a programmed computer execute a step for acquiring the whole number m of terms included in a document database in which n pieces of documents related with a specific field are summarized and the respective terms Tj(j=1, 2, 3, ..., m), and for managing the identification of the respective terms Tj, a step for calculating appearance frequency Wij related with the terms Ti in a document Di by a predetermined calculation formula, a step for calculating distribution S2j of the appearance frequency Wij value concerning the terms Tj, a step for calculating significance Vij of the terms Tj in the document Di by Vij=Uij×S2j by using the appearance frequency of the terms Tj in the document Di as Uij and a step for preparing and outputting a term list in which the terms Tj are listed up based on the Vij.;COPYRIGHT: (C)2006,JPO&NCIPI
机译:解决的问题:精确指定每个文档的特征关键字,并在文档数据库中一目了然地掌握每个文档的内容,该文档数据库中汇总了与特定领域相关的多个文档。提供了一个文档数据库,用于使编程计算机执行一个步骤,该步骤用于获取文档数据库中包括的术语m的总数,其中汇总了与特定字段有关的n个文档,并且各个术语T j j = 1,2,3,...,m),并且为了管理各个术语T j 的标识,计算出现频率的步骤W ij 通过预定的计算公式与文档D i 中的术语T i 相关,这是计算分布S 2的步骤与项T j 有关的出现频率W ij 值的 j ,这是计算有效位数的步骤V ij = U ij中文件D i 中术语T j 的V ij × S 2 j ,方法是使用文档D i 中术语T j 的出现频率 U ij ,以及准备和输出术语列表的步骤,其中基于V ij 列出了术语T j 。;版权:(C)2006,JPO&NCIPI

著录项

  • 公开/公告号JP2006085374A

    专利类型

  • 公开/公告日2006-03-30

    原文格式PDF

  • 申请/专利权人 KEIO GIJUKU;

    申请/专利号JP20040268702

  • 申请日2004-09-15

  • 分类号G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 21:53:19

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号