Enhancing semantic image retrieval with limited labeled examples via deep learning

Haijiao Xu; Changqin Huang; Dianhui Wang

首页> 外文期刊>Knowledge-Based Systems >Enhancing semantic image retrieval with limited labeled examples via deep learning

【24h】

Enhancing semantic image retrieval with limited labeled examples via deep learning

机译：通过深度学习通过有限的标记示例增强语义图像检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the rapid growth of the Internet, a large number of multi-modal objects such as images and their social tags can easily be downloaded from the Web. The use of such objects can improve training process in the presence of few or limited number of labeled images provided. In order to leverage these unlabeled and labeled multi-modal Web objects for enhancing the performance of unimodal image retrieval, we propose a novel approach called Semi-supervised Multi-concept Retrieval to semantic image retrieval via Deep Learning (SMRDL) in this paper. Differing from conventional methods that use multiple and independent concepts in a semantic multi-concept query, our proposed approach regards multiple concepts as a holistic scene for multi-concept scene learning of unimodal retrieval. In particular, we first train a multi-modal Convolutional Neural Network (CNN) as a concept classifier for images and texts, and then use it to annotate unlabeled Web images. For each of unlabeled images, we then obtain its most relevant concept annotations by using a new strategy of annotation promotion. Finally, we employ a unimodal visual CNN to train a concept classifier in visual modality, which uses both unlabeled and labeled examples for concept learning of unimodal retrieval. The results of our comprehensive experiments on two datasets of MIR Flickr 2011 and NUS-WIDE have shown that our proposed approach outperforms several state-of-the-art methods.

机译：随着Internet的快速发展，可以轻松地从Web下载大量的多模式对象，例如图像及其社交标签。在提供的标记图像很少或数量有限的情况下，使用此类对象可以改善训练过程。为了利用这些未标记和标记的多模态Web对象来增强单模态图像检索的性能，我们在本文中提出了一种称为“半监督多概念检索”的新方法，用于通过深度学习（SMRDL）进行语义图像检索。与在语义多概念查询中使用多个独立概念的常规方法不同，我们提出的方法将多个概念视为用于单模式检索的多概念场景学习的整体场景。特别是，我们首先训练多模式卷积神经网络（CNN）作为图像和文本的概念分类器，然后使用它来标注未标记的Web图像。然后，对于每个未标记的图像，我们通过使用新的注释促进策略来获取其最相关的概念注释。最后，我们采用单峰视觉CNN训练视觉模态中的概念分类器，该模型使用未标记和已标记的示例进行单峰检索的概念学习。我们对MIR Flickr 2011和NUS-WIDE的两个数据集进行的综合实验结果表明，我们提出的方法优于几种最新方法。

著录项

来源
《Knowledge-Based Systems》 |2019年第1期|252-266|共15页
作者
Haijiao Xu; Changqin Huang; Dianhui Wang;
展开▼
作者单位

Department of Computer Science and Information Technology, La Trobe University;

School of Information Technology in Education, South China Normal University;

The State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantic image retrieval; Semi-supervised learning; Convolutional neural networks; Concept-based image retrieval;

机译：语义图像检索;半监督学习;卷积神经网络;基于概念的图像检索;

相似文献

外文文献
中文文献
专利

1. DSRPH: Deep semantic-aware ranking preserving hashing for efficient multi-label image retrieval [J] . Shen Yiming, Feng Yong, Fang Bin, Information Sciences: An International Journal . 2020,第1期

机译：DSRPH：深度语义感知排名保存散列有效的多标签图像检索
2. Image retrieval method based on deep learning semantic feature extraction and regularization softmax [J] . Qinghai Wu Multimedia Tools and Applications . 2020,第13a14期

机译：基于深度学习语义特征提取和正规化的图像检索方法Softmax
3. Large-scale semantic web image retrieval using bimodal deep learning techniques [J] . Changqin Huang, Haijiao Xu, Liang Xie, Information Sciences: An International Journal . 2018,第期

机译：使用双模深度学习技术检索大规模语义网络图像检索
4. Deep semantic ranking based hashing for multi-label image retrieval [C] . Fang Zhao, Yongzhen Huang, Liang Wang, IEEE Conference on Computer Vision and Pattern Recognition . 2015

机译：基于深度语义排名的多标签图像检索的散列
5. Learning Robust Visual-Semantic Retrieval Models with Limited Supervision [D] . Mithun, Niluthpol Chowdhury. 2019

机译：学习强大的视觉语义检索模型，监督有限
6. Deep-Learning-Based Semantic Labeling for 2D Mammography and Comparison of Complexity for Machine Learning Tasks [O] . Paul H. Yi, Abigail Lin, Jinchi Wei, 2019

机译：基于深度学习的2D乳腺摄影语义标记和机器学习任务的复杂度比较
7. MARRYING DEEP LEARNING AND DATA FUSION FOR ACCURATE SEMANTIC LABELING OF SENTINEL-2 IMAGES [O] . G. Fonteix, M. Swaine, M. Leras, 2021

机译：嫁给深度学习和数据融合，准确语义标记Sentinel-2图像

Enhancing semantic image retrieval with limited labeled examples via deep learning

摘要

著录项

相似文献

相关主题

期刊订阅