首页> 外文会议>European Conference on Computer Vision >Amplifying Key Cues for Human-Object-Interaction Detection
【24h】

Amplifying Key Cues for Human-Object-Interaction Detection

机译:放大人对象交互检测的关键线索

获取原文

摘要

Human-object interaction (HOI) detection aims to detect and recognise how people interact with the objects that surround them. This is challenging as different interaction categories are often distinguished only by very subtle visual differences in the scene. In this paper we introduce two methods to amplify key cues in the image, and also a method to combine these and other cues when considering the interaction between a human and an object. First, we introduce an encoding mechanism for representing the fine-grained spatial layout of the human and object (a subtle cue) and also semantic context (a cue, represented by text embeddings of surrounding objects). Second, we use plausible future movements of humans and objects as a cue to constrain the space of possible interactions. Third, we use a gate and memory architecture as a fusion module to combine the cues. We demonstrate that these three improvements lead to a performance which exceeds prior HOI methods across standard benchmarks by a considerable margin.
机译:人对象交互(HOI)检测旨在检测和识别人们如何与周围的物体交互。这是具有挑战性,因为不同的互动类别通常仅通过场景中非常细微的视觉差异区分。在本文中,我们介绍了两种方法来放大图像中的关键线索,以及在考虑人类和物体之间的相互作用时结合这些和其他提示的方法。首先,我们介绍一种编码机制,用于表示人和物体(微妙提示)的细粒度空间布局以及语义上下文(由周围物体的文本嵌入而表示的提示)。其次,我们使用人和物体的合理的未来运动作为一个提示,以限制可能的互动的空间。第三,我们使用门和内存架构作为融合模块来组合提示。我们展示这三种改进导致性能导致超过标准基准以上的HOI方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号