...
首页> 外文期刊>Knowledge-Based Systems >Enhancing semantic image retrieval with limited labeled examples via deep learning
【24h】

Enhancing semantic image retrieval with limited labeled examples via deep learning

机译:通过深度学习通过有限的标记示例增强语义图像检索

获取原文
获取原文并翻译 | 示例
           

摘要

With the rapid growth of the Internet, a large number of multi-modal objects such as images and their social tags can easily be downloaded from the Web. The use of such objects can improve training process in the presence of few or limited number of labeled images provided. In order to leverage these unlabeled and labeled multi-modal Web objects for enhancing the performance of unimodal image retrieval, we propose a novel approach called Semi-supervised Multi-concept Retrieval to semantic image retrieval via Deep Learning (SMRDL) in this paper. Differing from conventional methods that use multiple and independent concepts in a semantic multi-concept query, our proposed approach regards multiple concepts as a holistic scene for multi-concept scene learning of unimodal retrieval. In particular, we first train a multi-modal Convolutional Neural Network (CNN) as a concept classifier for images and texts, and then use it to annotate unlabeled Web images. For each of unlabeled images, we then obtain its most relevant concept annotations by using a new strategy of annotation promotion. Finally, we employ a unimodal visual CNN to train a concept classifier in visual modality, which uses both unlabeled and labeled examples for concept learning of unimodal retrieval. The results of our comprehensive experiments on two datasets of MIR Flickr 2011 and NUS-WIDE have shown that our proposed approach outperforms several state-of-the-art methods.
机译:随着Internet的快速发展,可以轻松地从Web下载大量的多模式对象,例如图像及其社交标签。在提供的标记图像很少或数量有限的情况下,使用此类对象可以改善训练过程。为了利用这些未标记和标记的多模态Web对象来增强单模态图像检索的性能,我们在本文中提出了一种称为“半监督多概念检索”的新方法,用于通过深度学习(SMRDL)进行语义图像检索。与在语义多概念查询中使用多个独立概念的常规方法不同,我们提出的方法将多个概念视为用于单模式检索的多概念场景学习的整体场景。特别是,我们首先训练多模式卷积神经网络(CNN)作为图像和文本的概念分类器,然后使用它来标注未标记的Web图像。然后,对于每个未标记的图像,我们通过使用新的注释促进策略来获取其最相关的概念注释。最后,我们采用单峰视觉CNN训练视觉模态中的概念分类器,该模型使用未标记和已标记的示例进行单峰检索的概念学习。我们对MIR Flickr 2011和NUS-WIDE的两个数据集进行的综合实验结果表明,我们提出的方法优于几种最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号