Manhattan scene understanding using monocular, stereo, and 3D features

机译：使用单眼，立体和3D功能了解曼哈顿场景

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses scene understanding in the context of a moving camera, integrating semantic reasoning ideas from monocular vision with 3D information available through structure-from-motion. We combine geometric and photometric cues in a Bayesian framework, building on recent successes leveraging the indoor Manhattan assumption in monocular vision. We focus on indoor environments and show how to extract key boundaries while ignoring clutter and decorations. To achieve this we present a graphical model that relates photometric cues learned from labeled data, stereo photo-consistency across multiple views, and depth cues derived from structure-from-motion point clouds. We show how to solve MAP inference using dynamic programming, allowing exact, global inference in ∼100 ms (in addition to feature computation of under one second) without using specialized hardware. Experiments show our system out-performing the state-of-the-art.

机译：本文探讨了在移动相机环境中的场景理解，将单眼视觉的语义推理思想与可通过运动构造获得的3D信息相结合。我们结合贝叶斯框架中的几何和光度学线索，以单眼视觉中曼哈顿室内假设为基础的最新成功。我们专注于室内环境，并展示如何在忽略杂乱和装饰的同时提取关键边界。为了实现这一点，我们提出了一个图形模型，该模型将从标记数据中获悉的光度学线索，跨多个视图的立体光一致性以及从运动点云结构得出的深度线索联系起来。我们展示了如何使用动态编程解决MAP推理，如何在不使用专门硬件的情况下，在约100 ms内（除了不到一秒钟的特征计算）实现精确的全局推理。实验表明，我们的系统性能优于最新技术。

著录项

来源
《Computer Vision (ICCV), 2011 IEEE International Conference on》|2011年|p.2228-2235|共8页
会议地点 Barcelona(ES)
作者
Flint Alex; Murray David; Reid Ian;
展开▼
作者单位

Active Vision Laboratory, Oxford University, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP391.41;
关键词

相似文献

外文文献
中文文献
专利

1. Monocular Visual Scene Understanding: Understanding Multi-Object Traffic Scenes [J] . Wojek Christian, Walk Stefan, Roth Stefan, Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2013,第4期

机译：单目视觉场景理解：了解多对象交通场景
2. 3-D modeling of an outdoor scene from monocular image sequences by multi-baseline stereo [J] . Tomokazu Sato, Masayuki Kanbara, Naokazu Yokoya, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2001,第653期

机译：通过多基线立体声从单眼图像序列对室外场景进行3D建模
3. 3-D modeling of an outdoor scene from monocular image sequences by multi-baseline stereo [J] . Tomokazu Sato, Masayuki Kanbara, Naokazu Yokoya, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2001,第653期

机译：3-D由多基线立体声从单眼图像序列的室外场景建模
4. Manhattan scene understanding using monocular, stereo, and 3D features [C] . Flint Alex, Murray David, Reid Ian International Conference on Computer Vision . 2011

机译：曼哈顿场景了解使用单眼，立体声和3D功能
5. Multi-Planar 3D Reconstruction of Indoor Manhattan Scenes from Monocular Camera [D] . Kim, Seongdo. 2018

机译：通过单眼相机对曼哈顿室内场景进行多平面3D重建
6. A Miniature Binocular Endoscope with Local Feature Matching and Stereo Matching for 3D Measurement and 3D Reconstruction [O] . Di Wang, Hua Liu, Xiang Cheng 2018

机译：具有局部特征匹配和立体匹配的微型双目内窥镜用于3D测量和3D重构
7. Monocular {3D} Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes [O] . Wojek C., Roth S., Schindler K., 2010

机译：单目{3D}场景建模和推理：了解多目标交通场景

Manhattan scene understanding using monocular, stereo, and 3D features

摘要

著录项

相似文献

相关主题

期刊订阅