首页> 外文会议>IAPR International Workshop on Document Analysis Systems >A New One-Class Classification Method Based on Symbolic Representation: Application to Document Classification
【24h】

A New One-Class Classification Method Based on Symbolic Representation: Application to Document Classification

机译:一种基于符号表示的新单类分类方法:应用于文档分类

获取原文

摘要

Training a system using a small number of instances to obtain accurate recognition/classification is a crucial need in document classification domain. The one-class classification is chosen since only positive samples are available for the training. In this paper, a new one-class classification method based on symbolic representation method is proposed. Initially a set of features is extracted from the training set. A set of intervals valued symbolic feature vector is then used to represent the class. Each interval value (symbolic data) is computed using mean and standard deviation of the corresponding feature values. To evaluate the proposed one-class classification method a dataset composed of 544 document images was used. Experiment results reveal that the proposed one-class classification method works well even when the number of training samples is small (≤10). Moreover, we noted that the proposed one-class classification method is suitable for document classification and provides better result compared to one-class k-nearest neighbor (k-NN) classifier.
机译:使用少量实例训练系统以获得准确的识别/分类是文档分类域中的重要需求。选择单级分类,因为只有正样品可用于培训。本文提出了一种基于符号表示方法的新的单级分类方法。最初从训练集中提取了一组特征。然后使用一组间隔值符号特征向量来表示类。使用相应特征值的均值和标准偏差来计算每个间隔值(符号数据)。为了评估所提出的单级分类方法,使用了由544个文档图像组成的数据集。实验结果表明,即使训练样本的数量小(&Le; 10),所提出的单级分类方法也适用于井。此外,我们指出,与单级K-最近邻(K-NN)分类器相比,所提出的单级分类方法适用于文档分类,并提供更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号