首页> 外文会议>Asian conference on computer vision >Spatially Local Coding for Object Recognition
【24h】

Spatially Local Coding for Object Recognition

机译:用于对象识别的空间本地编码

获取原文

摘要

The spatial pyramid and its variants have been among the most popular and successful models for object recognition. In these models, local visual features are coded across elements of a visual vocabulary, and then these codes are pooled into histograms at several spatial granularities. We introduce spatially local coding, an alternative way to include spatial information in the image model. Instead of only coding visual appearance and leaving the spatial coherence to be represented by the pooling stage, we include location as part of the coding step. This is a more flexible spatial representation as compared to the fixed grids used in the spatial pyramid models and we can use a simple, whole-image region during the pooling stage. We demonstrate that combining features with multiple levels of spatial locality performs better than using just a single level. Our model performs better than all previous single-feature methods when tested on the Caltech 101 and 256 object recognition datasets.
机译:空间金字塔及其变体是对象识别最受欢迎和成功的模型之一。在这些模型中,局部可视特征在视觉词汇的元素中编码,然后将这些代码汇集到几个空间粒度的直方图中。我们在空间局部编码,一种包括在图像模型中的空间信息的替代方法。而不是仅编码视觉外观并将空间相干性留给池级表示,而不是作为编码步骤的一部分的位置。与空间金字塔模型中使用的固定网格相比,这是一种更灵活的空间表示,并且我们可以在汇集阶段使用简单的整个图像区域。我们证明,具有多个级别空间局部的功能比仅使用单个级别更好地表现出更好的特征。当在CALTECH 101和256对象识别数据集上测试时,我们的模型比以前的所有单一特征方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号