VQD: Visual Query Detection in Natural Scenes

机译：VQD：自然场景中的视觉查询检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose Visual Query Detection (VQD), a new visual grounding task. In VQD, a system is guided by natural language to localize a variable number of objects in an image. VQD is related to visual referring expression recognition, where the task is to localize only one object. We describe the first dataset for VQD and we propose baseline algorithms that demonstrate the difficulty of the task compared to referring expression recognition.

机译：我们提出了视觉查询检测（VQD），这是一个新的视觉接地任务。在VQD中，一个系统由自然语言引导，以本地化图像中的可变数量的对象。 VQD与可视引用的表达式识别有关，其中任务只能本地化一个对象。我们描述了VQD的第一个DataSet，我们提出了与引用表达式识别相比，展示了任务难度的基线算法。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|1955-1961|共7页
会议地点
作者
Manoj Acharya; Karan Jariwala; Christopher Kanan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Visual Saliency Detection Using Group Lasso Regularization in Videos of Natural Scenes [J] . Souly Nasim, Shah Mubarak International Journal of Computer Vision . 2016,第1期

机译：在自然场景视频中使用组套索正则化进行视觉显着性检测
2. Visual cue diagnosticity for boundary detection in natural scenes: A computational study [J] . David Alex M??ly, Junkyung Kim, Mason McGill, Journal of vision . 2014,第10期

机译：自然场景边界检测的视觉提示诊断：一项计算研究
3. Face detection differs from categorization: Evidence from visual search in natural scenes [J] . Markus Bindemann, Michael B. Lewis Psychonomic bulletin & review . 2013,第6期

机译：人脸检测不同于分类：自然场景中视觉搜索的证据
4. VQD: Visual Query Detection in Natural Scenes [C] . Manoj Acharya, Karan Jariwala, Christopher Kanan Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：VQD：自然场景中的视觉查询检测
5. The Role of Visual Features in the Affective Categorization of Briefly Presented Naturalistic Scenes [D] . Rhodes, L. Jack. 2019

机译：视觉特征在简单呈现自然主义场景的情感分类中的作用
6. Natural scene statistics account for the representation of scene categories in human visual cortex [O] . Dustin E. Stansbury, Thomas Naselaris, Jack L. Gallant -1

机译：自然场景统计数据说明了人类视觉皮层中场景类别的表示方式
7. A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes [O] . Renwu Gao, Faisal Shafait, Seiichi Uchida, 2015

机译：一种用于自然场景中字符检测的分层视觉显着模型
8. Natural Language Query System Design for Interactive Information Storage and Retrieval Systems. Presentation Visuals. Final Report, July 1, 1985-December 31, 1987 [R] . Dominick, W. D., Liu, I. 1985

机译：交互式信息存储与检索系统的自然语言查询系统设计。演示视觉。最终报告，1985年7月1日至1987年12月31日

VQD: Visual Query Detection in Natural Scenes

摘要

著录项

相似文献

相关主题

期刊订阅