Discriminative learning of relaxed hierarchy for large-scale visual recognition

机译：松散等级的判别学习用于大规模视觉识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the real visual world, the number of categories a classifier needs to discriminate is on the order of hundreds or thousands. For example, the SUN dataset [24] contains 899 scene categories and ImageNet [6] has 15,589 synsets. Designing a multiclass classifier that is both accurate and fast at test time is an extremely important problem in both machine learning and computer vision communities. To achieve a good trade-off between accuracy and speed, we adopt the relaxed hierarchy structure from [15], where a set of binary classifiers are organized in a tree or DAG (directed acyclic graph) structure. At each node, classes are colored into positive and negative groups which are separated by a binary classifier while a subset of confusing classes is ignored. We color the classes and learn the induced binary classifier simultaneously using a unified and principled max-margin optimization. We provide an analysis on generalization error to justify our design. Our method has been tested on both Caltech-256 (object recognition) [9] and the SUN dataset (scene classification) [24], and shows significant improvement over existing methods.

机译：在现实的视觉世界中，分类器需要区分的类别数量约为数百或数千。例如，SUN数据集[24]包含899个场景类别，而ImageNet [6]具有15,589个同义词集。在机器学习和计算机视觉社区中，设计一种在测试时既准确又快速的多类分类器是一个非常重要的问题。为了在精度和速度之间取得良好的折衷，我们采用[15]中的宽松层次结构，其中一组二进制分类器以树或DAG（有向无环图）结构进行组织。在每个节点上，将类别分为正组和负组，它们由二进制分类器分隔，而混淆类的子集则被忽略。我们使用统一且原则上的最大边距优化为类着色并同时学习归纳二进制分类器。我们对泛化误差进行了分析，以证明我们的设计合理。我们的方法已经在Caltech-256（对象识别）[9]和SUN数据集（场景分类）[24]上进行了测试，并且显示出对现有方法的显着改进。

著录项

来源
《Computer Vision (ICCV), 2011 IEEE International Conference on》|2011年|p.2072-2079|共8页
会议地点 Barcelona(ES)
作者
Tianshi Gao; Koller Daphne;
展开▼
作者单位

Dept. of Electrical Engineering, Stanford University, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP391.41;
关键词

相似文献

外文文献
中文文献
专利

1. Jointly Learning Visually Correlated Dictionaries for Large-Scale Visual Recognition Applications [J] . IEEE Transactions on Pattern Analysis and Machine Intelligence . 2014,第4期

机译：联合学习用于大型视觉识别应用程序的视觉相关词典
2. Learning representation hierarchies by sharing visual features: a computational investigation of Persian character recognition with unsupervised deep learning [J] . Sadeghi Zahra, Testolin Alberto Cognitive processing . 2017,第3期

机译：通过分享视觉特征学习代表层次结构：与无监督深度学习的波斯字符识别的计算调查
3. Discriminative dictionary pair learning based on differentiable support vector function for visual recognition [J] . Boheng Chen, Jie Li, Biyun Ma, Neurocomputing . 2018,第jana10期

机译：基于差分支持向量函数的判别词典对学习
4. Discriminative learning of relaxed hierarchy for large-scale visual recognition [C] . Tianshi Gao, Koller Daphne International Conference on Computer Vision . 2011

机译：大规模视觉识别的轻松等级辨别性学习
5. Hierarchical learning of discriminative features and classifiers for large-scale visual recognition. [D] . Zhou, Ning. 2014

机译：用于大规模视觉识别的区分性特征和分类器的分层学习。
6. Towards a Robust Visual Place Recognition in Large-Scale vSLAM Scenarios Based on a Deep Distance Learning [O] . Liang Chen, Sheng Jin, Zhoujun Xia 2021

机译：基于深度远程学习的大规模VSLAM情景中的强大视觉识别
7. Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition [O] . Zhao, Tianyi, Zhang, Baopeng, Zhang, Wei, 2017

机译：使用Deep Networks嵌入Visual Hierarchy进行大规模Visual 承认

Discriminative learning of relaxed hierarchy for large-scale visual recognition

摘要

著录项

相似文献

相关主题

期刊订阅