首页> 外文会议>International Conference on Neural Information Processing >REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering
【24h】

REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering

机译:REXUP:我有理由,我提取,我用结构化的组成推理更新了视觉问题的回答

获取原文

摘要

Visual Question Answering (VQA) is a challenging multi-modal task that requires not only the semantic understanding of images and questions, but also the sound perception of a step-by-step reasoning process that would lead to the correct answer. So far, most successful attempts in VQA have been focused on only one aspect; either the interaction of visual pixel features of images and word features of questions, or the reasoning process of answering the question of an image with simple objects. In this paper, we propose a deep reasoning VQA model (REXUP- REason, EXtract, and UPdate) with explicit visual structure-aware textual information, and it works well in capturing step-by-step reasoning process and detecting complex object-relationships in photo-realistic images. REXUP consists of two branches, image object-oriented and scene graph-oriented, which jointly works with the super-diagonal fusion compositional attention networks. We evaluate REXUP on the benchmark GQA dataset and conduct extensive ablation studies to explore the reasons behind REXUP's effectiveness. Our best model significantly outperforms the previous state-of-the-art, which delivers 92.7% on the validation set, and 73.1% on the test-dev set.
机译:视觉问题应答(VQA)是一个具有挑战性的多模态任务,不仅需要对图像和问题的语义理解,而且还需要对逐步推理过程的声音感知,这将导致正确答案。到目前为止,VQA中最成功的尝试仅集中在一个方面;视觉像素特征的互动图像和词题的特征,或者用简单对象回答图像问题的推理过程。在本文中,我们提出了一个深入推理的VQA模型(REXUP-原因,提取和更新),具有明确的可视结构感知文本信息,并且在捕获逐步推理过程和检测复杂对象关系中,它运行良好照片逼真的图像。 REXUP由两个分支机构,图像面向对象和场景面向图形,它与超对角融合组成关注网络共同使用。我们在基准GQA数据集中评估REXUP,并进行广泛的消融研究,以探讨REXUP效率背后的原因。我们最好的模型显着优于以前的最先进,在验证集中提供92.7%,测试开发集中的73.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号