...
首页> 外文期刊>Image and Vision Computing >Fine-grained categorization via CNN-based automatic extraction and integration of object-level and part-level features
【24h】

Fine-grained categorization via CNN-based automatic extraction and integration of object-level and part-level features

机译:通过基于CNN的自动提取以及对象级和零件级特征的集成进行细分类

获取原文
获取原文并翻译 | 示例
           

摘要

Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories. Handcrafted features have been widely used for part detection and classification. Although a recent trend seeks to learn such features automatically using powerful deep learning models such as convolutional neural networks (CNN), their training and possibly also testing require manually provided annotations which are costly to obtain. To relax these requirements, we assume in this study a general problem setting in which the raw images are only provided with object-level class labels for model training with no other side information needed. Specifically, by extracting and interpreting the hierarchical hidden layer features learned by a CNN, we propose an elaborate CNN-based system for fine-grained categorization. When evaluated on the Caltech-UCSD Birds-200-2011, FGVC-Aircraft, cars and Stanford dogs datasets under the setting that only object-level class labels are used for training and no other annotations are available for both training and testing, our method achieves impressive performance that is superior or comparable to the state of the art. Moreover, it sheds some light on ingenious use of the hierarchical features learned by CNN which has wide applicability well beyond the current fine-grained categorization task. (C) 2017 Elsevier B.V. All rights reserved.
机译:细分类可以受益于基于零件的功能,这些功能揭示了对象类别之间的细微视觉差异。手工制作的特征已广泛用于零件检测和分类。尽管最近的趋势是寻求使用强大的深度学习模型(例如卷积神经网络(CNN))自动学习这些功能,但是它们的训练以及可能的测试也需要手动提供的注释,而这些注释的获取成本很高。为了放宽这些要求,我们在本研究中假设一个一般性的问题设置,其中原始图像仅提供对象级类标签以进行模型训练,而无需其他辅助信息。具体来说,通过提取和解释CNN所学习的分层隐藏层特征,我们提出了一种基于精细CNN的精细分类系统。在Caltech-UCSD Birds-200-2011,FGVC-飞机,汽车和斯坦福犬数据集上进行评估时,我们的方法是仅使用对象级类标签进行训练,而没有其他注释可用于训练和测试。达到令人印象深刻的性能,该性能优于或可比。此外,它还揭示了CNN所学的分层功能的巧妙使用,该功能具有广泛的适用性,远远超出了当前的细粒度分类任务。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号