Modeling spatial layout for scene image understanding via a novel multiscale sum-product network

Yuan Zehuan; Wang Hao; Wang Limin; Lu Tong; Palaiahnakote Shivakumara; Tan Chew Lim

首页> 外文期刊>Expert Systems with Application >Modeling spatial layout for scene image understanding via a novel multiscale sum-product network

【24h】

Modeling spatial layout for scene image understanding via a novel multiscale sum-product network

机译：通过新颖的多尺度求和积网络对空间布局进行建模以了解场景图像

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic image segmentation is challenging due to the large intra-class variations and the complex spatial layouts inside natural scenes. This paper investigates this problem by designing a new deep architecture, called multiscale sum-product network (MSPN), which utilizes multiscale unary potentials as the inputs and models the spatial layouts of image content in a hierarchical manner. That is, the proposed MSPN models the joint distribution of multiscale unary potentials and object classes instead of single unary potentials in popular settings. Besides, MSPN characterizes scene spatial layouts in a fine-to-coarse manner to enforce the consistency in labeling. Multiscale unary potentials at different scales can thus help overcome semantic ambiguities caused by only evaluating single local regions, while long-range spatial correlations can further refine image labeling. In addition, higher orders are able to pose the constraints among labels, By this way, multi-scale unary potentials, long-range spatial correlations, higher-order priors are well modeled under the uniform framework in MSPN. We conduct experiments on two challenging benchmarks consisting of the MSRC-21 dataset and the SIFT FLOW dataset. The results demonstrate the superior performance of our method comparing with the previous graphical models for understanding scene images. (C) 2016 Elsevier Ltd. All rights reserved.

机译：由于类内差异较大以及自然场景内部的空间布局复杂，因此语义图像分割具有挑战性。本文通过设计一种称为多尺度和乘积网络（MSPN）的新的深层体系结构来研究此问题，该体系结构利用多尺度一元电势作为输入并以分层方式对图像内容的空间布局进行建模。也就是说，提出的MSPN可以模拟多尺度一元势和对象类的联合分布，而不是在流行环境中的单元势。此外，MSPN以精细到粗略的方式表征场景空间布局，以增强标签的一致性。因此，不同尺度的多尺度一元电势可以帮助克服仅评估单个局部区域而引起的语义歧义，而远程空间相关可以进一步完善图像标记。此外，高阶能够在标签之间施加约束。这样，在MSPN的统一框架下就可以很好地建模多尺度一元势，远程空间相关性，高阶先验。我们在由MSRC-21数据集和SIFT FLOW数据集组成的两个具有挑战性的基准上进行实验。结果表明，与用于理解场景图像的先前图形模型相比，我们的方法具有优越的性能。（C）2016 Elsevier Ltd.保留所有权利。

著录项

来源
《Expert Systems with Application》 |2016年第11期|231-240|共10页
作者
Yuan Zehuan; Wang Hao; Wang Limin; Lu Tong; Palaiahnakote Shivakumara; Tan Chew Lim;
展开▼
作者单位

Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China;

Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China;

Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland;

Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China;

Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia;

Natl Univ Singapore, Sch Comp, Singapore, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Scene image understanding; Spatial layout; Multiscale unary potentials; Multiscale sum-product network; MSPN;

机译：场景图像理解;空间布局;多尺度一元势;多尺度和积网络;MSPN;

相似文献

外文文献
中文文献
专利

1. Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks [J] . Xue Yang, Hao Sun, Kun Fu, Remote Sensing . 2018,第1期

机译：基于多尺度旋转密集特征金字塔网络的复杂场景谷歌地球遥感图像自动检测
2. Modeling the spatial layout of images beyond spatial pyramids [J] . Jorge Sanchez, Florent Perronnin, Teofilo de Campos Pattern recognition letters . 2012,第16期

机译：对超出空间金字塔的图像的空间布局进行建模
3. Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision [J] . Seunghoon Hong, Suha Kwak, Bohyung Han IEEE Signal Processing Magazine . 2017,第6期

机译：使用深度卷积神经网络进行语义监督的弱监督学习：以最少的人工监督了解图像的语义布局
4. Layout and Context Understanding for Image Synthesis with Scene Graphs [C] . Arces Talavera, Daniel Stanley Tan, Arnulfo Azcarraga, IEEE International Conference on Image Processing . 2019

机译：使用场景图进行图像合成的布局和上下文理解
5. Seeing the world behind the image: Spatial layout for three-dimensional scene understanding [D] . Hoiem, Derek 2007

机译：看到图像背后的世界：用于三维场景理解的空间布局
6. Image-based multiscale modeling predicts tissue-level and network-level fiber reorganization in stretched cell-compacted collagen gels [O] . Edward A. Sander, Triantafyllos Stylianopoulos, Robert T. Tranquillo, 2019

机译：基于图像的多尺度建模可预测拉伸细胞压紧的胶原蛋白凝胶中的组织级和网络级纤维重组
7. 3D spatial layout and geometric constraints for scene understanding [O] . Hedau Varsha 2011

机译：用于场景理解的3D空间布局和几何约束
8. Improved canopy reflectance modeling and scene inference through improved understanding of scene pattern [R] . Franklin, Janet, Simonett, David 1988

机译：通过改进对场景模式的理解，改进了冠层反射建模和场景推理

Modeling spatial layout for scene image understanding via a novel multiscale sum-product network

摘要

著录项

相似文献

相关主题

期刊订阅