首页> 外文会议>IEEE International Conference on Intelligent Transportation Systems >A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View
【24h】

A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View

机译:一种SIM2REAL深度学习方法,用于在鸟瞰图中从多车载摄像机从多载载体摄像机转换图像的深度学习方法

获取原文

摘要

Accurate environment perception is essential for automated driving. When using monocular cameras, the distance estimation of elements in the environment poses a major challenge. Distances can be more easily estimated when the camera perspective is transformed to a bird’s eye view (BEV). For flat surfaces, Inverse Perspective Mapping (IPM) can accurately transform images to a BEV. Three-dimensional objects such as vehicles and vulnerable road users are distorted by this transformation making it difficult to estimate their position relative to the sensor. This paper describes a methodology to obtain a corrected 360° BEV image given images from multiple vehicle-mounted cameras. The corrected BEV image is segmented into semantic classes and includes a prediction of occluded areas. The neural network approach does not rely on manually labeled data, but is trained on a synthetic dataset in such a way that it generalizes well to real-world data. By using semantically segmented images as input, we reduce the reality gap between simulated and real-world data and are able to show that our method can be successfully applied in the real world. Extensive experiments conducted on the synthetic data demonstrate the superiority of our approach compared to IPM. Source code and datasets are available at https://github.com/ika-rwth-aachen/Cam2BEV.
机译:准确的环境感知对于自动驾驶至关重要。使用单眼摄像机时,环境中元素的距离估计会产生重大挑战。当相机透视转换为鸟瞰图(BEV)时,可以更容易地估计距离。对于平面,反向透视映射(IPM)可以将图像精确地变换为BEV。诸如车辆和易受攻击的道路用户之类的三维物体因该转变而扭曲,使得难以估计其相对于传感器的位置。本文介绍了一种方法,以获得来自多车载相机的校正的360°BEV图像。将校正的BEV图像分段为语义类,并且包括对遮挡区域的预测。神经网络方法不依赖于手动标记的数据,但是在合成数据集上培训,以这样的方式概括到现实世界数据。通过使用语义分段图像作为输入,我们降低了模拟和现实世界之间的现实差距,并且能够表明我们的方法可以在现实世界中成功应用。与IPM相比,对合成数据进行的广泛实验表明了我们的方法的优越性。源代码和数据集是在https://github.com/ika-rwth-aachen/cam2bev中获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号