首页> 外文期刊>Wissenschaftliche Arbeiten der Fachrichtung Geodasie und Geoinformatik der Leibniz Universitat Hannover >Probabilistic Pose Estimation and 3D Reconstruction of Vehicles from Stereo Images
【24h】

Probabilistic Pose Estimation and 3D Reconstruction of Vehicles from Stereo Images

机译:立体图像概率姿态估计与车辆的三维重建

获取原文
           

摘要

The pose estimation and reconstruction of 3D objects from images is one of the major problems that are addressed in computer vision and photogrammetry. The understanding of a 3D scene and the 3D reconstruction of specific objects are prerequisites for many highly relevant applications of computer vision such as mobile robotics and autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable association and alignment with the 3D model. The goal of this thesis is to infer valuable observations from the images and to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of the different observation types. To achieve this goal, this thesis presents three major contributions for the particular task of 3D vehicle reconstruction from street-level stereo images. First, a subcategory-aware deformable vehicle model is introduced that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. Second, a Convolutional Neural Network (CNN) is proposed which extracts observations from an image. In particular, the CNN is used to derive a prediction of the vehicle orientation and type, which are introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model association and model fitting. Third, the task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Suitable parametrisations and formulations of likelihood and prior terms are introduced for a joint consideration of the derived observations and prior information in the probabilistic objective function. As the objective function is non-convex and discontinuous, a proper customized strategy based on stochastic sampling is proposed for inference, yielding convincing results for the estimated poses and shapes of the vehicles. To evaluate the performance and to investigate the strengths and limitations of the proposed method, extensive experiments are conducted using two challenging real-world data sets: the publicly available KITTI benchmark and the ICSENS data set, which was created in the scope of this thesis. On both data sets, the benefit of the developed shape prior and of each of the individual components of the probabilistic model can be shown. The proposed method yields vehicle pose estimates with a median error of up to 27 cm for the position and up to 1.7° for the orientation on the data sets. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed model and inference procedure.
机译:来自图像的3D对象的姿势估计和重建是计算机视觉和摄影测量中解决的主要问题之一。对3D场景的理解和特定对象的3D重建是计算机愿景的许多高度相关应用,例如移动机器人和自主驾驶的先决条件。为了从他们的2D投影中重建3D对象的逆问题,通过建立3D模型并将其对准到2D图像平面来将先前的对象知识结合到重建方法中。然而,由于形状前沿不充分的形状前沿和用于可靠关联和与3D模型对准的衍生图像观察的不足,电流方法受到限制。本文的目标是从图像中推断有价值的观察,并展示3D对象重建在不同观察类型的组合结合中可以从更复杂的形状中获利。为实现这一目标,本文为来自街道立体声图像的3D车辆重建的特定任务提供了三项主要贡献。首先,引入了子类别感知可变形车辆模型,其利用车辆类型的车辆类型的预测来实现车辆形状的更合适的正则化。其次,提出了一种卷积神经网络(CNN),其从图像中提取观察。特别地,CNN用于导出车辆取向和类型的预测,其被引入作为模型配件的先前信息。此外,CNN提取车辆关键点和线框,其非常适合于模型关联和模型配件。第三,通过多功能概率模型解决了姿势估计和重建的任务。介绍了适当的似然和前提条件和先前术语的配方,用于联合考虑潜在的观察结果和在概率目标函数中的先前信息。由于目标函数是非凸的并且不连续的,提出了一种基于随机取样的适当定制的策略进行推理,从而产生估计姿势和车辆形状的令人信服的结果。为了评估绩效并调查所提出的方法的优点和局限性,使用两个具有挑战性的现实世界数据集进行了广泛的实验:公开可用的基蒂基准和ICSENS数据集,这些数据集是在本论文的范围内创建的。在两个数据集上,可以示出在概率模型的每个单独组件之前和开发形状的益处。所提出的方法产生车辆姿势估计,位于位置最多27厘米的中值误差,并且在数据集上的方向上最高为1.7°。与车辆姿势估计的最先进方法的比较表明,所提出的方法执行PAR或更好,确认了开发模型和推理过程的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号