首页> 外文会议>Computer Vision (ICCV), 2011 IEEE International Conference on >Discriminative learning of relaxed hierarchy for large-scale visual recognition
【24h】

Discriminative learning of relaxed hierarchy for large-scale visual recognition

机译:松散等级的判别学习用于大规模视觉识别

获取原文
获取原文并翻译 | 示例

摘要

In the real visual world, the number of categories a classifier needs to discriminate is on the order of hundreds or thousands. For example, the SUN dataset [24] contains 899 scene categories and ImageNet [6] has 15,589 synsets. Designing a multiclass classifier that is both accurate and fast at test time is an extremely important problem in both machine learning and computer vision communities. To achieve a good trade-off between accuracy and speed, we adopt the relaxed hierarchy structure from [15], where a set of binary classifiers are organized in a tree or DAG (directed acyclic graph) structure. At each node, classes are colored into positive and negative groups which are separated by a binary classifier while a subset of confusing classes is ignored. We color the classes and learn the induced binary classifier simultaneously using a unified and principled max-margin optimization. We provide an analysis on generalization error to justify our design. Our method has been tested on both Caltech-256 (object recognition) [9] and the SUN dataset (scene classification) [24], and shows significant improvement over existing methods.
机译:在现实的视觉世界中,分类器需要区分的类别数量约为数百或数千。例如,SUN数据集[24]包含899个场景类别,而ImageNet [6]具有15,589个同义词集。在机器学习和计算机视觉社区中,设计一种在测试时既准确又快速的多类分类器是一个非常重要的问题。为了在精度和速度之间取得良好的折衷,我们采用[15]中的宽松层次结构,其中一组二进制分类器以树或DAG(有向无环图)结构进行组织。在每个节点上,将类别分为正组和负组,它们由二进制分类器分隔,而混淆类的子集则被忽略。我们使用统一且原则上的最大边距优化为类着色并同时学习归纳二进制分类器。我们对泛化误差进行了分析,以证明我们的设计合理。我们的方法已经在Caltech-256(对象识别)[9]和SUN数据集(场景分类)[24]上进行了测试,并且显示出对现有方法的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号