首页> 外文会议>IEEE International Conference on Big Data and Smart Computing >Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering
【24h】

Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering

机译:基于静态相关滤波器的卷积神经网络用于视觉问答

获取原文

摘要

Visual Question Answering (VQA) has received increasing attentions due to the success of computer vision and natural language processing. The computer is required to understand the image, comprehend and reply to the question. The data modal of images makes it harder to answer than textual questions. In general, as VQA tasks use Convolutional Neural Networks (CNN) to extract image features, a better CNN model is preferred for obtaining better image representations. In this paper, the Static Correlative Filter (SCF) which is an advanced technique in convolutional layers is employed for VQA, as convolutional layer is the major component of CNN. The effectiveness of SCF for VQA is demonstrated by the experiments on the benchmark dataset of COCO-QA with two baseline image question answering models.
机译:由于计算机视觉和自然语言处理的成功,视觉问答(VQA)受到越来越多的关注。要求计算机理解图像,理解并回答问题。图像的数据模式比文本问题更难回答。通常,由于VQA任务使用卷积神经网络(CNN)提取图像特征,因此,最好使用更好的CNN模型来获得更好的图像表示。由于卷积层是CNN的主要组成部分,因此本文将卷积层中的一种先进技术静态相关滤波器(SCF)用于VQA。通过在COCO-QA基准数据集上使用两个基准图像问题回答模型进行的实验,证明了SCF对VQA的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号