首页> 外文期刊>Knowledge-Based Systems >A decisive content based image retrieval approach for feature fusion in visual and textual images
【24h】

A decisive content based image retrieval approach for feature fusion in visual and textual images

机译:基于决定性内容的图像检索方法,用于视觉和文本图像中的特征融合

获取原文
获取原文并翻译 | 示例
           

摘要

Image content analysis plays a dynamic role in various computer vision applications. These contents can be either visual (i.e. color, shape, texture) or the textual (i.e. text appearing within images). Both the contents involve fundamental characteristics of an image and thus can be an enormous asset for any intelligent application. For content based image retrieval (CBIR) systems, most of the art methods are either annotated text based or the visual search based. Due to high demand of multitasking, there is a great need of a system that can combine visual as well as textual features. Consequently, this work proposes a decisive CBIR approach that combines visual and textual features to retrieve similar images. Firstly, the method classifies the query image as textual and non-textual. If any text appears within the image then the query image is classified as textual, and the text is detected and formed as Bag of Textual words. If the query image is classified as non-textual, the visual salient features are extracted and formed as Bag of Visual words. Next, the method fuses the visual and textual features, and top similar images are retrieved based on the fused feature vector. It supports three modes of retrieval: Image query, Keywords, and a combination of both. The experimental results on four datasets show the efficiency and accuracy of the proposed approach for visual and textual images. (C) 2019 Elsevier B.V. All rights reserved.
机译:图像内容分析在各种计算机视觉应用程序中发挥着动态作用。这些内容可以是视觉的(即颜色,形状,纹理),也可以是文本的(即图像中出现的文本)。这两个内容都涉及图像的基本特征,因此对于任何智能应用程序来说都是巨大的资产。对于基于内容的图像检索(CBIR)系统,大多数现有技术方法是基于注释文本或基于视觉搜索。由于对多任务的高需求,因此迫切需要一种可以将视觉和文本功能相结合的系统。因此,这项工作提出了一种决定性的CBIR方法,该方法结合了视觉和文本特征来检索相似的图像。首先,该方法将查询图像分为文本和非文本。如果图像中出现任何文本,则查询图像将被分类为文本,文本将被检测为文本袋。如果查询图像被分类为非文本,则将视觉显着特征提取并形成为“视觉袋”。接下来,该方法融合了视觉和文字特征,并基于融合的特征向量检索了顶部的相似图像。它支持三种检索模式:图像查询,关键字以及二者的组合。在四个数据集上的实验结果表明了所提出的视觉和文本图像方法的效率和准确性。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号