...
首页> 外文期刊>IEEE Transactions on Image Processing >Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval
【24h】

Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval

机译:有效的多查询扩展:协作深度网络可实现可靠的地标检索

获取原文
获取原文并翻译 | 示例
           

摘要

Given a query photo issued by a user (q-user), the landmark retrieval is to return a set of photos with their landmarks similar to those of the query, while the existing studies on the landmark retrieval focus on exploiting geometries of landmarks for similarity matches between candidate photos and a query photo. We observe that the same landmarks provided by different users over social media community may convey different geometry information depending on the viewpoints and/or angles, and may, subsequently, yield very different results. In fact, dealing with the landmarks with low quality shapes caused by the photography of q-users is often nontrivial and has seldom been studied. In this paper, we propose a novel framework, namely, multi-query expansions, to retrieve semantically robust landmarks by two steps. First, we identify the top- k photos regarding the latent topics of a query landmark to construct multi-query set so as to remedy its possible low quality shape. For this purpose, we significantly extend the techniques of Latent Dirichlet Allocation. Then, motivated by the typical collaborative filtering methods, we propose to learn a collaborative deep networks-based semantically, nonlinear, and high-level features over the latent factor for landmark photo as the training set, which is formed by matrix factorization over collaborative user-photo matrix regarding the multi-query set. The learned deep network is further applied to generate the features for all the other photos, meanwhile resulting into a compact multi-query set within such space. Then, the final ranking scores are calculated over the high-level feature space between the multi-query set and all other photos, which are ranked to serve as the final ranking list of landmark retrieval. Extensive experiments are conducted on real-world social media data with both landmark photos together with their user information to show the superior performance over the existing methods, especially our recently proposed multi-query based mid-level pattern representation method [1].
机译:给定一个由用户(q-user)发出的查询照片,地标检索将返回其地标与查询相似的一组照片,而有关地标检索的现有研究集中于利用地标的几何形状进行相似性研究匹配候选照片和查询照片。我们观察到,由不同用户在社交媒体社区上提供的相同地标可能会根据视点和/或角度传达不同的几何信息,并且随后可能会产生非常不同的结果。实际上,处理由q用户拍摄而导致的低质量形状的地标通常是不容易的,并且很少进行研究。在本文中,我们提出了一个新颖的框架,即多查询扩展,可以通过两个步骤来检索语义上稳定的地标。首先,我们确定与查询地标的潜在主题相关的前k张照片,以构造多查询集,以纠正其可能的低质量形状。为此,我们大大扩展了潜在Dirichlet分配技术。然后,在典型的协作过滤方法的启发下,我们建议学习基于协作深度矩阵的语义,非线性和高级特征,并将其作为地标照片的潜在因素作为训练集,这是通过对协作用户进行矩阵分解而形成的-有关多查询集的照片矩阵。进一步将学习到的深度网络应用于生成所有其他照片的特征,同时在此空间内生成紧凑的多查询集。然后,在多查询集和所有其他照片之间的高级特征空间上计算最终排名分数,将其排名以用作地标检索的最终排名列表。在现实世界的社交媒体数据上进行了广泛的实验,将两张地标照片及其用户信息一起显示出优于现有方法的性能,尤其是我们最近提出的基于多查询的中级模式表示方法[1]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号