首页> 外国专利> AUTOMATIC DOCUMENT CLASSIFICATION DEVICE, LEARNING DEVICE, CLASSIFICATION DEVICE, AUTOMATIC DOCUMENT CLASSIFICATION METHOD, LEARNING METHOD, CLASSIFICATION METHOD AND STORAGE MEDIUM

AUTOMATIC DOCUMENT CLASSIFICATION DEVICE, LEARNING DEVICE, CLASSIFICATION DEVICE, AUTOMATIC DOCUMENT CLASSIFICATION METHOD, LEARNING METHOD, CLASSIFICATION METHOD AND STORAGE MEDIUM

机译:自动文件分类设备,学习设备,分类设备,自动文件分类方法,学习方法,分类方法和存储介质

摘要

PROBLEM TO BE SOLVED: To provide an automatic document classification device which can form a vector space where topics are precisely reflected and which can appropriately execute classification. ;SOLUTION: The automatic document classification device selects a valid word from a learning document (valid word selection part 103). The number of the valid words contained in respective paragraphs is obtained by referring to the learning document and the valid word (intra-paragraph valid word number calculation part 105). The intra-paragraph cooccurrence frequency of the group of the respective valid words is obtained by using the number of intra-paragraph valid words (intra-paragraph cooccurrence calculation part 107). The valid word vectors of the respective valid words are obtained from obtained intra- paragraph cooccurrence frequency, and the document vectors are obtained on the learning document and the document being a classification object by referring to the valid word vectors. The folder vectors of the respective categories, which are obtained from the document vector of the learning document, are compared with the document vector of the document being the classification object. The category to which the document being the classification object belongs is decided in accordance with the compared result.;COPYRIGHT: (C)1999,JPO
机译:要解决的问题:提供一种自动文档分类装置,该装置可以形成矢量空间,在该矢量空间中可以精确反映主题并可以适当地执行分类。 ;解决方案:自动文件分类装置从学习文件中选择一个有效词(有效词选择部分103)。通过参考学习文档和有效单词来获得各个段落中包含的有效单词的数量(段内有效单词数量计算部105)。通过使用段内有效字的数量来获得各个有效字的组的段内共现频率(段内共现计算部分107)。从获得的段内共现频率获得各个有效词的有效词向量,并且通过参考有效词向量在学习文档和作为分类对象的文档上获得文档向量。从学习文档的文档矢量获得的各个类别的文件夹矢量与作为分类对象的文档的文档矢量进行比较。根据比较结果确定作为分类对象的文档所属的类别。版权所有:(C)1999,JPO

著录项

获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号