A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View

机译：一种SIM2REAL深度学习方法，用于在鸟瞰图中从多车载摄像机从多载载体摄像机转换图像的深度学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Accurate environment perception is essential for automated driving. When using monocular cameras, the distance estimation of elements in the environment poses a major challenge. Distances can be more easily estimated when the camera perspective is transformed to a bird’s eye view (BEV). For flat surfaces, Inverse Perspective Mapping (IPM) can accurately transform images to a BEV. Three-dimensional objects such as vehicles and vulnerable road users are distorted by this transformation making it difficult to estimate their position relative to the sensor. This paper describes a methodology to obtain a corrected 360° BEV image given images from multiple vehicle-mounted cameras. The corrected BEV image is segmented into semantic classes and includes a prediction of occluded areas. The neural network approach does not rely on manually labeled data, but is trained on a synthetic dataset in such a way that it generalizes well to real-world data. By using semantically segmented images as input, we reduce the reality gap between simulated and real-world data and are able to show that our method can be successfully applied in the real world. Extensive experiments conducted on the synthetic data demonstrate the superiority of our approach compared to IPM. Source code and datasets are available at https://github.com/ika-rwth-aachen/Cam2BEV.

机译：准确的环境感知对于自动驾驶至关重要。使用单眼摄像机时，环境中元素的距离估计会产生重大挑战。当相机透视转换为鸟瞰图（BEV）时，可以更容易地估计距离。对于平面，反向透视映射（IPM）可以将图像精确地变换为BEV。诸如车辆和易受攻击的道路用户之类的三维物体因该转变而扭曲，使得难以估计其相对于传感器的位置。本文介绍了一种方法，以获得来自多车载相机的校正的360°BEV图像。将校正的BEV图像分段为语义类，并且包括对遮挡区域的预测。神经网络方法不依赖于手动标记的数据，但是在合成数据集上培训，以这样的方式概括到现实世界数据。通过使用语义分段图像作为输入，我们降低了模拟和现实世界之间的现实差距，并且能够表明我们的方法可以在现实世界中成功应用。与IPM相比，对合成数据进行的广泛实验表明了我们的方法的优越性。源代码和数据集是在https://github.com/ika-rwth-aachen/cam2bev中获得的。

著录项

来源
《IEEE International Conference on Intelligent Transportation Systems》|2020年|1-7|共7页
会议地点
作者
Lennart Reiher; Bastian Lampe; Lutz Eckstein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cameras; Image segmentation; Neural networks; Task analysis; Semantics; Roads; Vehicle dynamics;

机译：摄像机;图像分割;神经网络;任务分析;语义;道路;车辆动态;

相似文献

外文文献
中文文献
专利

1. Bird's-eye view generation at arbitrary viewpoint from multi-camera images [J] . Makoto Sekitoh, Koichi Toyota, Toshiaki Fujii, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2000,第633期

机译：从多摄像机图像中以任意视点生成鸟瞰图
2. Bird's-eye view generation at arbitrary viewpoint from multi-camera images [J] . Makoto Sekitoh, Koichi Toyota, Toshiaki Fujii, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2000,第633期

机译：鸟瞰图在多相机图像中的任意观点时代
3. Deep learning for automated river-level monitoring through river-camera images: an approach based on water segmentation and transfer learning [J] . Vandaele Remy, Dance Sarah L., Ojha Varun Hydrology and Earth System Sciences Discussions . 2021,第8期

机译：通过河流相机图像深度学习自动化河流监控：一种基于水分割和转移学习的方法
4. Spatio-temporal bird's-eye view images using multiple fish-eye cameras [C] . Sato Takaaki, Moro Alessandro, Sugahara Atsushi, IEEE/SICE International Symposium on System Integration . 2013

机译：使用多个鱼眼镜头的时空鸟瞰图像
5. Automatic Information Extraction from Camera-Trap Images Using Deep Learning [D] . Norouzzadeh, Mohammad Sadegh. 2019

机译：使用深度学习的摄像机陷阱图像自动提取
6. A bird’s-eye view of deep learning in bioimage analysis [O] . Erik Meijering 2020

机译：生物贴图分析深度学习的鸟瞰图
7. A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View [O] . Lennart Reiher, Bastian Lampe, Lutz Eckstein 2020

机译：一种SIM2REAL深度学习方法，用于在鸟瞰图中从多车载摄像机从多载载体摄像机转换图像的深度学习方法

A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View

摘要

著录项

相似文献

相关主题

期刊订阅