首页> 外文会议>IEEE International Conference on Big Data and Smart Computing >Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering
【24h】

Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering

机译:基于静态相关滤波器的卷积神经网络,用于视觉问题应答

获取原文

摘要

Visual Question Answering (VQA) has received increasing attentions due to the success of computer vision and natural language processing. The computer is required to understand the image, comprehend and reply to the question. The data modal of images makes it harder to answer than textual questions. In general, as VQA tasks use Convolutional Neural Networks (CNN) to extract image features, a better CNN model is preferred for obtaining better image representations. In this paper, the Static Correlative Filter (SCF) which is an advanced technique in convolutional layers is employed for VQA, as convolutional layer is the major component of CNN. The effectiveness of SCF for VQA is demonstrated by the experiments on the benchmark dataset of COCO-QA with two baseline image question answering models.
机译:由于计算机视觉和自然语言处理的成功,视觉问题应答(VQA)已收到增加的注意。需要计算机以了解图像,理解和回复问题。数据模态的图像模态使得难以回答而不是文本问题。通常,随着VQA任务使用卷积神经网络(CNN)来提取图像特征,优选更好的CNN模型来获得更好的图像表示。本文采用稳定相关滤波器(SCF),其是卷积层中的先进技术的VQA,因为卷积层是CNN的主要成分。通过两个基线图像问题应答模型的Coco-QA基准数据集的实验证明了SCF对VQA的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号