首页> 外文会议>International Conference of the Cross-Language Evaluation Forum for European Languages >An Effective Deep Transfer Learning and Information Fusion Framework for Medical Visual Question Answering
【24h】

An Effective Deep Transfer Learning and Information Fusion Framework for Medical Visual Question Answering

机译:一种有效的深度转移学习和信息融合框架,用于医学视觉问题应答

获取原文

摘要

Medical visual question answering (Med-VQA) is very important for better clinical decision support and enhanced patient engagement in patient-centered medical care. Compared with open domain VQA tasks, VQA in medical domain becomes more challenging due to limited training resources as well as unique characteristics on medical images and domain vocabularies. In this paper, we propose and develop a novel deep transfer learning model, ETM-Trans, which exploits embedding topic modeling (ETM) on textual questions to derive topic labels to pair with associated medical images for finetuning the pre-trained ImageNet model. We also explore and implement a co-attention mechanism where residual networks is used to extract visual features from image interacting with the long-short term memory (LSTM) based question representation providing fine-grained contextual information for answer derivation. To efficiently integrate visual features from the image and textual features from the question, we employ Multimodal Factor-ized Bilinear (MFB) pooling as well as Multimodal Factorized High-order (MFH) pooling. The ETM-Trans model won the international Med-VQA 2018 challenge, achieving the best WBSS score of 0.186.
机译:医学视觉问题应答(Med-VQA)对于更好的临床决策支持非常重要,并提高患者以患者为中心的医疗的患者参与。与开放式域VQA任务相比,由于有限的培训资源以及医学图像和域名词汇表的独特特征,医学领域的VQA变得更具挑战性。在本文中,我们提出并开发了一种新的深度转移学习模型,ETM-Trans,它在文本问题上利用嵌入主题建模(ETM),以导出主题标签与相关的医学图像配对,以便训练预先训练的Imagenet模型。我们还探索和实施,其中残留的网络被用来提取图像视觉特征与基于长短期记忆(LSTM)问题表示提供答案推导细粒度的上下文信息交互的共同关注机制。为了从问题的图像和文本特征中有效地集成视觉特征,我们采用多模式因子-Ized Bilinear(MFB)池以及多模式分解高阶(MFH)汇集。 ETM-Trans Model赢得了国际MED-VQA 2018挑战,实现了0.186的最佳WBSS得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号