...
首页> 外文期刊>Neurocomputing >Dual-View Semantic Inference Network for image-text matching
【24h】

Dual-View Semantic Inference Network for image-text matching

机译:用于图像文本匹配的双视图语义推理网络

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, image-text matching based on local region-word semantic alignment has attracted considerable research attention. The fine-grained interplay can be achieved by aggregating the similarities of the region-word pairs. However, the similarities of aligned region-word pairs are treated equally in most cross-modal matching literatures, without considering their respective importance. Moreover, the local alignment methods are prone to bring about a global semantic drift due to the ignorance of thematic considerations for the image-text pairs. In this paper, a novel Dual-View Semantic Inference (DVSI) network is proposed to leverage both local and global semantic matching in a holistic deep framework. For the local view, a region enhancement module is proposed to mine the priorities for different regions in the image, which provides differentiate abilities to discover the latent region-word relationships. For the global view, the overall semantics of image is summarized for global semantic matching to avoid global semantic drift. The two views are unified together for final image-text matching. Extensive experiments conducted on MSCOCO and Flicr30K demonstrate the effectiveness of the proposed DVSI. (C) 2020 Elsevier B.V. All rights reserved.
机译:最近,基于本地区域词语语义对齐的图像文本匹配引起了相当大的研究关注。通过聚合区域字对对的相似性可以实现细粒度的相互作用。然而,对齐区域词对的相似性在大多数跨模型匹配文献中同样地治疗,而不考虑它们各自的重要性。此外,由于图像文本对的专题考虑因素的无知,局部对准方法容易引起全局语义漂移。在本文中,提出了一种新颖的双视图语义推理(DVSI)网络,以利用整体深层框架中的本地和全局语义匹配。对于本地视图,提出了一个区域增强模块来挖掘图像中不同区域的优先级,这提供了发现潜在区域词关系的区分能力。对于全球视图,概述了图像的整体语义,以避免全球语义漂移。两个视图统一,用于最终图像文本匹配。在MSCOCO和FLICR30K上进行的广泛实验证明了所提出的DVSI的有效性。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2021年第22期|47-57|共11页
  • 作者单位

    China Univ Petr East China Coll Comp Sci & Technol Qingdao Peoples R China;

    China Univ Petr East China Coll Comp Sci & Technol Qingdao Peoples R China;

    China Univ Petr East China Coll Comp Sci & Technol Qingdao Peoples R China;

    China Univ Petr Beijing Karamay Sch Petr Engn Beijing Peoples R China;

    China Univ Petr East China Coll Comp Sci & Technol Qingdao Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Image-text matching; Global semantic matching; Local semantic matching;

    机译:图像文本匹配;全局语义匹配;局部语义匹配;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号