首页> 外国专利> String similarity calculation unit, string similarity calculation program, the string similarity calculation method and a computer-readable recording medium recorded it

String similarity calculation unit, string similarity calculation program, the string similarity calculation method and a computer-readable recording medium recorded it

机译:字符串相似度计算单元,字符串相似度计算程序,字符串相似度计算方法和记录该字符串的计算机可读记录介质

摘要

PROBLEM TO BE SOLVED: To speed up document retrieval by selecting a partial character string used for similarity calculation.;SOLUTION: An input character string X and a document Y in a document database are regarded as two character strings and their similarity is calculated. Partial character strings cut out of the input character string are sorted according to their appearance frequencies and recorded in a partial character string management table. Then matching information is gathered as to the respective partial character strings in the partial character string management table and recorded in a matching information management table. A list regarding the document Y is taken out of the table and the similarity to the input character string X is calculated. The document number and the similarity are recorded in a pair in a document management table. Those processes are repeated for all documents. Lastly, the document management table is rearranged in the decreasing order of the similarity and a document having high similarity is selected as a retrieval result from the database.;COPYRIGHT: (C)2002,JPO
机译:解决的问题:通过选择用于相似度计算的部分字符串来加快文档检索。;解决方案:将文档数据库中的输入字符串X和文档Y视为两个字符串,并计算它们的相似度。从输入字符串中切出的部分字符串根据其出现频率进行排序,并记录在部分字符串管理表中。然后,在部分字符串管理表中收集关于各个部分字符串的匹配信息,并将其记录在匹配信息管理表中。从表中取出关于文档Y的列表,并计算与输入字符串X的相似度。文档编号和相似性成对记录在文档管理表中。对所有文档重复这些过程。最后,以相似度从高到低的顺序重新排列文档管理表,并从数据库中选择具有高相似度的文档作为检索结果。版权所有:(C)2002,JPO

著录项

  • 公开/公告号JP4065695B2

    专利类型

  • 公开/公告日2008-03-26

    原文格式PDF

  • 申请/专利权人 住友電気工業株式会社;

    申请/专利号JP20020012259

  • 发明设计人 梅村 恭司;

    申请日2002-01-22

  • 分类号G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 20:18:26

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号