首页> 外文会议> >Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval
【24h】

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval

机译:全面召回:具有生成对象模型的生成特征模型的自动查询扩展

获取原文

摘要

Given a query image of an object, our objective is to retrieve all instances of that object in a large (1M+) image database. We adopt the bag-of-visual-words architecture which has proven successful in achieving high precision at low recall. Unfortunately, feature detection and quantization are noisy processes and this can result in variation in the particular visual words that appear in different images of the same object, leading to missed results. In the text retrieval literature a standard method for improving performance is query expansion. A number of the highly ranked documents from the original query are reissued as a new query. In this way, additional relevant terms can be added to the query. This is a form of blind relevance feedback and it can fail if `outlier'' (false positive) documents are included in the reissued query. In this paper we bring query expansion into the visual domain via two novel contributions. Firstly, strong spatial constraints between the query image and each result allow us to accurately verify each return, suppressing the false positives which typically ruin text-based query expansion. Secondly, the verified images can be used to learn a latent feature model to enable the controlled construction of expanded queries. We illustrate these ideas on the 5000 annotated image Oxford building database together with more than 1M Flickr images. We show that the precision is substantially boosted, achieving total recall in many cases.
机译:给定对象的查询图像,我们的目标是在大型(1M +)图像数据库中检索该对象的所有实例。我们采用了可视化词袋体系结构,该体系结构已被证明可以在低召回率下实现高精度。不幸的是,特征检测和量化是嘈杂的过程,这可能导致出现在同一对象的不同图像中的特定视觉单词发生变化,从而导致结果丢失。在文本检索文献中,提高性能的一种标准方法是查询扩展。来自原始查询的许多排名较高的文档将重新发布为新查询。这样,可以将其他相关术语添加到查询中。这是一种盲目的相关性反馈,如果重新发布的查询中包含“异常”(误报)文档,则可能会失败。在本文中,我们通过两个新颖的贡献将查询扩展带入可视域。首先,查询图像和每个结果之间的强烈空间约束使我们能够准确地验证每个返回值,从而抑制了误报(通常会破坏基于文本的查询扩展)。其次,经过验证的图像可用于学习潜在特征模型,以实现扩展查询的受控构造。我们在5000幅带注释的牛津建筑数据库图像上说明了这些想法,并提供了超过1M张Flickr图像。我们表明,精度大大提高,在许多情况下都实现了完全召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号