首页> 外文学位 >On graph-structured discrete labeling problems in computer vision: Learning, inference and applications.

【24h】

On graph-structured discrete labeling problems in computer vision: Learning, inference and applications.

机译：关于计算机视觉中的图结构离散标注问题：学习，推理和应用。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A number of problems in computer vision (e.g., image segmentation, gender classification of faces, etc) can be formulated as graph-structured discrete labelling problems, where the goal is to predict labels (e.g. foreground/background, male/female) for a set of variables (e.g. pixels, faces in an image, etc) that have some known underlying structure (e.g., neighbouring pixels in an image often have related labels). This task of inferring optimal labels of structured variables is typically posed as the minimization of a discrete energy function over a graph, and is NP-hard for general graphs.In the first part, we present algorithms for learning the parameters of these energy functions from fully-labelled and partially-labelled training data.In the second part, we present a new approximate inference algorithm called Outer-Planar Decomposition (OPD). OPD decomposes the given intractable energy-minimization problem over a graph into tractable subproblems over outerplanar subgraphs and then employs message passing over these subgraphs to get an approximate global solution for the original graph. OPD outperforms current state-of-art inference methods on bard synthetic problems and is competitive on real computer-vision applications.Finally, in the third part, we demonstrate our work and apply this structured prediction paradigm to interactive co-segmentation of groups of related images and interactive 3D reconstruction of objects and scenes.This thesis is composed of three parts.

机译：可以将计算机视觉中的许多问题（例如，图像分割，面部性别分类等）表述为图形结构的离散标签问题，其中目标是预测对象的标签（例如，前景/背景，男性/女性）。具有一些已知基础结构（例如，图像中的相邻像素通常具有相关标签）的一组变量（例如，像素，图像中的面部等）。推断结构变量的最佳标签的任务通常是使图上的离散能量函数最小化，并且对于一般图来说是NP-难的。在第一部分中，我们提出了用于从中学习这些能量函数的参数的算法在第二部分中，我们提出了一种新的近似推理算法，称为外平面分解（OPD）。 OPD将图上的给定的棘手的能量最小化问题分解为外平面子图上的可处理子问题，然后使用消息传递这些子图以获得原始图的近似全局解。 OPD在盲合成问题上的性能优于当前的最新推理方法，并且在实际的计算机视觉应用程序中具有竞争力。最后，在第三部分中，我们演示了我们的工作并将此结构化的预测范式应用于相关组的交互式共细分图像和对象与场景的交互式3D重建。本文由三个部分组成。

著录项

作者
Batra, Dhruv.;
展开▼
作者单位

Carnegie Mellon University.;

展开▼
授予单位 Carnegie Mellon University.;
学科 Artificial Intelligence.Computer Science.
学位 Ph.D.
年度 2010
页码 105 p.
总页数 105
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Graphical Model MAP Inference with Continuous Label Space in Computer Vision [J] . Fortschritt-Berichte VDI . 2018,第860期

机译：在计算机视觉中具有连续标签空间的图形模型MAP推理
2. Inference and Learning of Graphical Models: Theory and Applications in Computer Vision and Image Analysis [J] . Chaohui Wang, Nikos Komodakis, Hiroshi Ishikawa, Computer vision and image understanding . 2016,第Feba期

机译：图形模型的推理和学习：计算机视觉和图像分析的理论和应用
3. Special Issue on Large-Scale Computer Vision: Geometry, Inference, and Learning [J] . Roberto Cipolla, Carlo Colombo, Alberto Del Bimbo International Journal of Computer Vision . 2014,第3期

机译：大型计算机视觉特刊：几何，推理和学习
4. Towards Efficient and Exact MAP-Inference for Large Scale Discrete Computer Vision Problems via Combinatorial Optimization [C] . Kappes Jorg Hendrik, Speth Markus, Reinelt Gerhard, IEEE Conference on Computer Vision and Pattern Recognition . 2013

机译：通过组合优化实现大规模离散计算机视觉问题的高效且精确的MAP推理
5. Spectral graph matching, learning, and inference for computer vision. [D] . Leordeanu, Marius Dan. 2010

机译：光谱图匹配，学习和计算机视觉推断。
6. CGBayesNets: Conditional Gaussian Bayesian Network Learning and Inference with Mixed Discrete and Continuous Data [O] . Michael J. McGeachie, Hsun-Hsien Chang, Scott T. Weiss 2014

机译：CGBayesNets：带混合离散和连续数据的条件高斯贝叶斯网络学习和推理
7. Markov Random Field Modeling, Inference Learning in Computer Vision Image Understanding: A Survey [O] . Wang Chaohui, Komodakis Nikos, Paragios Nikos 2013

机译：计算机视觉和图像理解中的马尔可夫随机场建模，推理和学习：一项调查
8. Study of Relaxation Labelling Methods and Their Role in Computer Vision [R] . Desimone, R. 1983

机译：松弛标记方法及其在计算机视觉中的作用研究

On graph-structured discrete labeling problems in computer vision: Learning, inference and applications.

摘要

著录项

相似文献

相关主题

期刊订阅