首页>
外国专利>
String similarity calculation unit, string similarity calculation program, the string similarity calculation method and a computer-readable recording medium recorded it
String similarity calculation unit, string similarity calculation program, the string similarity calculation method and a computer-readable recording medium recorded it
PROBLEM TO BE SOLVED: To speed up document retrieval by selecting a partial character string used for similarity calculation.;SOLUTION: An input character string X and a document Y in a document database are regarded as two character strings and their similarity is calculated. Partial character strings cut out of the input character string are sorted according to their appearance frequencies and recorded in a partial character string management table. Then matching information is gathered as to the respective partial character strings in the partial character string management table and recorded in a matching information management table. A list regarding the document Y is taken out of the table and the similarity to the input character string X is calculated. The document number and the similarity are recorded in a pair in a document management table. Those processes are repeated for all documents. Lastly, the document management table is rearranged in the decreasing order of the similarity and a document having high similarity is selected as a retrieval result from the database.;COPYRIGHT: (C)2002,JPO
展开▼