首页> 外文会议>European conference on computer vision >Visual Code-Sentences: A New Video Representation Based on Image Descriptor Sequences
【24h】

Visual Code-Sentences: A New Video Representation Based on Image Descriptor Sequences

机译:视觉代码句:基于图像描述符序列的新视频表示

获取原文

摘要

We present a new descriptor-sequence model for action recognition that enhances discriminative power in the spatio-temporal context, while maintaining robustness against background clutter as well as variability in inter-/intra-person behavior. We extend the framework of Dense Trajectories based activity recognition (Wang et al., 2011) and introduce a pool of dynamic Baye-sian networks (e.g., multiple HMMs) with histogram descriptors as codebooks of composite action categories represented at respective key points. The entire codebooks bound with spatio-temporal interest points constitute intermediate feature representation as basis for generic action categories. This representation scheme is intended to serve as visual code-sentences which subsume a rich vocabulary of basis action categories. Through extensive experiments using KTH, UCF Sports, and Hollywood2 datasets, we demonstrate some improvements over the state-of-the-art methods.
机译:我们提出了一种新的动作识别描述符序列模型,该模型增强了时空背景下的判别能力,同时保持了针对背景混乱的稳健性以及人际/人际行为的可变性。我们扩展了基于密集轨迹的活动识别的框架(Wang等人,2011),并引入了动态贝叶斯网络(例如多个HMM)池,其中直方图描述符作为在各个关键点表示的复合动作类别的代码本。与时空兴趣点绑定的整个密码本构成中间特征表示,作为通用动作类别的基础。此表示方案旨在用作可视代码句,包含大量基础动作类别的词汇。通过使用KTH,UCF Sports和Hollywood2数据集进行的广泛实验,我们展示了对最新方法的一些改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号