首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Multi-scale Location-Aware Kernel Representation for Object Detection
【24h】

Multi-scale Location-Aware Kernel Representation for Object Detection

机译:用于对象检测的多尺度位置感知内核表示

获取原文

摘要

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods demonstrate that the integration of high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so that they cannot be directly adopted to object detection. In this paper, we make an attempt to exploit high-order statistics in object detection, aiming at generating more discriminative representations for proposals to enhance the performance of detectors. To this end, we propose a novel Multi-scale Location-aware Kernel Representation (MLKP) to capture high-order statistics of deep features in proposals. Our MLKP can be efficiently computed on a modified multi-scale feature map using a low-dimensional polynomial kernel approximation. Moreover, different from existing orderless global representations based on high-order statistics, our proposed MLKP is location retentive and sensitive so that it can be flexibly adopted to object detection. Through integrating into Faster R-CNN schema, the proposed MLKP achieves very competitive performance with state-of-the-art methods, and improves Faster R-CNN by 4.9% (mAP), 4.7% (mAP) and 5.0% (AP at IOU=[0.5:0.05:0.95]) on PASCAL VOC 2007, VOC 2012 and MS COCO benchmarks, respectively. Code is available at: https://github.com/Hwang64/MLKP.
机译:虽然R-CNN及其变体更快地显示了对象检测中的有希望的性能,但它们仅利用对象提案的简单一阶表示,以获得最终分类和回归。最近的分类方法表明,高阶统计到深度卷积神经网络的集成可以实现令人印象深刻的改进,但它们的目标是通过丢弃位置信息来模拟整个图像,使得它们不能直接采用对象检测。在本文中,我们试图利用对象检测中的高阶统计数据,旨在为提高探测器的性能产生更多辨别性表示。为此,我们提出了一种新的多尺度位置感知内核表示(MLKP),以捕获提案中的深度特征的高阶统计信息。我们的MLKP可以使用低维多项式内核近似在修改的多尺度特征图上有效地计算。此外,与基于高阶统计数据的现有有序全局表示不同,我们提出的MLKP是定位保持和敏感,以便可以灵活地采用对象检测。通过整合到更快的R-CNN架构方案,所提出的MLKP通过最先进的方法实现了非常竞争力的性能,并提高了4.9%(地图),4.7%(地图)和5.0%的速度r-CNN(AP iou = [0.5:0.05:0.95])分别在Pascal VOC,VOC 2012和MS Coco基准。代码可用:https://github.com/hwang64/mlkp。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号