首页> 外文期刊>Pattern recognition letters >Semantic segmentation of RGBD images based on deep depth regression
【24h】

Semantic segmentation of RGBD images based on deep depth regression

机译:基于深度深度回归的RGBD图像语义分割

获取原文
获取原文并翻译 | 示例
       

摘要

Depth information has been discovered to improve the performance of computer vision tasks, such as semantic segmentation and object recognition. However, careful acquisition of depth data needs highly developed depth sensors which are expensive. As a classic computer vision task, depth estimation from a single image has obtained promising results based on supervised learning methods. In this paper, we investigate the extension of color images with corresponding deep-regressed depth images in boosting the performance of semantic segmentation. Furthermore, the usage of combining color channels with the estimated depth or the ground truth depth channel is compared. Specifically, there are two stages in our work. Firstly, we adopt the framework of convolutional neural networks (CNN) for the depth estimation by combing the global depth network and the depth gradient network. After refining based on these two networks, the depth image map can be estimated in a deep-regressed manner. Secondly, after augmenting the color images with the predicted depth images, fully convolutional networks (FCN) are further used to implement the pixel-level semantic labeling. In the experiments, we employ two popular RGBD datasets, i.e., SUNRGBD and NYUDv2, for 37 and 40-class semantic segmentation, respectively. By comparing with the ground truth depth images, experimental results demonstrate that the networks trained on the estimated depth images can achieve comparable performance on facilitating the accuracy of semantic segmentation task. (C) 2017 Elsevier B.V. All rights reserved.
机译:已经发现深度信息可以改善计算机视觉任务的性能,例如语义分割和对象识别。然而,仔细获取深度数据需要高度开发的深度传感器,其昂贵。作为经典的计算机视觉任务,基于监督学习方法,从单个图像进行深度估计已获得了令人鼓舞的结果。在本文中,我们研究了彩色图像的扩展以及相应的深度回归深度图像,以提高语义分割的性能。此外,比较了将颜色通道与估计深度或地面真实深度通道组合在一起的用法。具体来说,我们的工作分为两个阶段。首先,我们采用卷积神经网络(CNN)的框架,通过结合全局深度网络和深度梯度网络进行深度估计。在基于这两个网络进行细化之后,可以以深度回归的方式估计深度图像图。其次,在用预测的深度图像增强彩色图像之后,进一步使用全卷积网络(FCN)来实现像素级语义标记。在实验中,我们分别使用两个流行的RGBD数据集SUNRGBD和NYUDv2分别进行了37类和40类语义分割。通过与地面真实深度图像进行比较,实验结果表明,在估计的深度图像上训练的网络在促进语义分割任务的准确性上可以达到可比的性能。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号