首页> 外文会议>International Conference on Image and Signal Processing >A Comparative Study of Irregular Pyramid Matching in Bag-of-Bags of Words Model for Image Retrieval
【24h】

A Comparative Study of Irregular Pyramid Matching in Bag-of-Bags of Words Model for Image Retrieval

机译:图像检索袋袋袋模型中的不规则金字塔匹配的比较研究

获取原文

摘要

In this paper we assess three standard approaches to build irregular pyramid partitions for image retrieval in the bag-of-bags of words model that we recently proposed. These three approaches are: kernel k-means to optimize multilevel weighted graph cuts, Normalized Cuts and Graph Cuts, respectively. The bag-of-bags of words (BBoW) model, is an approach based on irregular pyramid partitions over the image. An image is first represented as a connected graph of local features on a regular grid of pixels. Irregular partitions (subgraphs) of the image are further built by using graph partitioning methods. Each subgraph in the partition is then represented by its own signature. The BBo W model with the aid of graph, extends the classical bag-of-words (BoW) model, by embedding color homogeneity and limited spatial information through irregular partitions of an image. Compared to existing methods for image retrieval, such as Spatial Pyramid Matching (SPM), the BBoW model does not assume that similar parts of a scene always appear at the same location in images of the same category. The extension of the proposed model to pyramid gives rise to a method we name irregular pyramid matching (IPM). The experiments on Caltech-101 benchmark demonstrate that applying kernel k-means to graph clustering process produces better retrieval results, as compared with other graph partitioning methods such as Graph Cuts and Normalized Cuts for BBoW. Moreover, this proposed method achieves comparable results and outperforms SPM in 19 object categories on the whole Caltech-101 dataset.
机译:在本文中,我们评估三个标准的方法来构建图像检索中袋的袋的话模型,我们最近提出的不规则金字塔分区。这三种方法是:核k-手段分别优化的多级加权图割,标准化切割和图割。袋的袋的字(BBoW)模型,是基于在所述图像金字塔不规则分区的方法。的图像被首先表示为局部特征的像素的规则网格的连通图。图像的不规则分区(子图)是通过使用图形分割方法的进一步构建的。分区中的每个子图,然后通过自己的签名表示。与图形的帮助下BBO W型,扩展了经典袋的字(BOW)模型,通过图像的不规则分区嵌入颜色均匀性和有限的空间信息。相比于图像检索现有的方法,如空间金字塔匹配(SPM),该BBoW模型并不假定一个场景中相同的部件总是出现在同一类别的图像相同的位置。该模型以金字塔的延长引起我们的名字不规则的金字塔匹配(IPM)的方法。上加州理工学院-101基准实验表明,应用内核k-均值到图聚类过程产生更好的检索结果,与其他图分割方法,例如图割和BBoW标准化切割相比较。此外,该提议的方法实现类似的结果,优于SPM在对整个加州理工学院-101数据集19个的对象类别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号