首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension
【24h】

YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension

机译:YouMakeup:用于细粒度语义理解的大规模领域特定多模式数据集

获取原文

摘要

Multimodal semantic comprehension has attracted increasing research interests in recent years, such as visual question answering and caption generation. However, due to the data limitation, fine-grained semantic comprehension which requires to capture semantic details of multimodal contents has not been well investigated. In this work, we introduce "YouMakeup", a large-scale multimodal instructional video dataset to support finegrained semantic comprehension research in specific domain. YouMakeup contains 2,800 videos from YouTube, spanning more than 420 hours in total. Each video is annotated with a sequence of natural language descriptions for instructional steps, grounded in temporal video range and spatial facial areas. The annotated steps in a video involve subtle difference in actions, products and regions, which require fine-grained understanding and reasoning both temporally and spatially. In order to evaluate models' ability for fined-grained comprehension, we further propose two groups of tasks including generation tasks and visual question answering tasks from different aspects. We also establish a baseline of step caption generation for future comparison.
机译:近年来,多模式语义理解已经吸引了越来越多的研究兴趣,例如视觉问题解答和字幕生成。然而,由于数据的限制,需要捕获多模式内容的语义细节的细粒度语义理解尚未得到很好的研究。在这项工作中,我们介绍了“ YouMakeup”,这是一个大规模的多模式教学视频数据集,可支持特定领域的细粒度语义理解研究。 YouMakeup包含来自YouTube的2,800个视频,总计超过420小时。每个视频都以一系列自然语言描述进行注释,这些描述以教学步骤为基础,并以时间视频范围和空间面部区域为基础。视频中带注释的步骤涉及动作,产品和区域的细微差异,需要在时间和空间上进行细粒度的理解和推理。为了评估模型的细粒度理解能力,我们进一步提出了两组任务,包括生成任务和视觉问题回答任务的不同方面。我们还建立了步骤字幕生成的基线,以便将来进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号