首页> 外文会议>IEEE International Conference on Multimedia and Expo >Combine Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text Matching
【24h】

Combine Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text Matching

机译:早期和晚期融合在一起结合在一起:用于图像文本匹配的混合融合框架

获取原文

摘要

Image-text matching is a challenging task in cross-modal learning due to the discrepancy of data representation be-tween different modalities of images and texts. The main-stream methods adopt the late fusion to generate image-text similarity on encoded cross-modal features, and put effort to capture intra-modality associations with considerably high training cost. In this work, we propose to Combine Early and Late Fusion Together (CELFT), which is a universal hybrid fusion framework that can effectively overcome the above shortcomings of the late fusion scheme. In the pro-posed CELFT framework, the hybrid structure with early fusion and late fusion could facilitate the interaction between image and text modalities at early stage. Moreover, these two kinds of fusion strategies complement each other in capturing the inter-modal and intra-modal information, which ensure to learn more accurate image-text similarity. In the experiments, we choose four latest approaches based on the late fusion scheme as the base models, and integrate them with our CELFT framework. The results on two widely used image-text datasets MSCOCO and Flickr30K show that the matching performance of all base models is significantly improved with remarkably reduced training time.
机译:由于数据表示的差异为-Tween的图像和文本模式,因此图像 - 文本匹配是跨模型学习中的一个具有挑战性的任务。主流方法采用后期融合在编码的跨模型特征上生成图像文本相似性,并努力捕获模特内关联,培训高培训成本。在这项工作中,我们建议将早期和晚期融合在一起(Celft),这是一种通用的混合融合框架,可以有效地克服了晚期融合方案的上述缺点。在Pro-Posed Celft框架中,具有早期融合和晚期融合的混合结构可以促进早期图像和文本方式之间的相互作用。此外,这两种融合策略在捕获模态和模态信息中相互补充,这确保了解更准确的图像文本相似性。在实验中,我们根据基础融合方案选择四种最新方法作为基础型号,并将其与我们的Celft框架集成。两个广泛使用的图像文本数据集Mscoco和Flickr30k的结果表明,所有基础型号的匹配性能都显着提高,培训时间显着降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号