...
首页> 外文期刊>Neurocomputing >Selective residual learning for Visual Question Answering
【24h】

Selective residual learning for Visual Question Answering

机译:用于视觉问题的选择性剩余学习

获取原文
获取原文并翻译 | 示例
           

摘要

Visual Question Answering (VQA) aims to reason an answer, given a textual question and image pair. VQA methods are required to learn the relationship between image region features. These methods have the limitation of inefficient learning that can produce a performance drop. It is because current intra-relationship methods are trying to learn all the intra-relationships, regardless of their importance. In this paper, a novel self-attention based VQA module named Selective Residual learning (SelRes) is proposed. SelRes processes the residual learning selectively in self-attention networks. It measures the importance of the input vectors by the attention map and limits residual learning, except in the selected regions which related to the correct answer. Selective masking is also proposed, which can ensure that the selection in SelRes is preserved in the multi-stack structure of the VQA network. Our full model achieves new state-of-the-art performances on both from-scratch and fine-tuning models. (C) 2020 Elsevier B.V. All rights reserved.
机译:视觉问题应答(VQA)旨在给出一个答案,给定文本问题和图像对。需要VQA方法来学习图像区域特征之间的关系。这些方法限制了可能产生性能下降的低效学习。这是因为当前的内部关系方法试图了解所有内部关系,无论他们的重要性如何。本文提出了一种名为选择性残差学习(SELRES)的新型自我关注的VQA模块。 Selres在自我关注网络中选择性地处理残差学习。除了与正确答案相关的区域之外,它测量输入向量的重要性并限制残余学习。还提出了选择性掩蔽,可以确保在VQA网络的多堆叠结构中保留SELRES中的选择。我们的全部模式在划伤和微调模型上实现了新的最先进的表演。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号