...
首页> 外文期刊>Image and Vision Computing >Towards optimal VLAD for human action recognition from still images
【24h】

Towards optimal VLAD for human action recognition from still images

机译:寻求从静止图像识别人为动作的最佳VLAD

获取原文
获取原文并翻译 | 示例
           

摘要

Human action recognition from still image has recently drawn increasing attention in human behavior analysis and also poses great challenges due to the huge inter ambiguity and intra variability. Vector of locally aggregated descriptors (VLAD) has achieved state-of-the-art performance in many image classification tasks based on local features. The great success of VLAD is largely due to its high descriptive ability and computational efficiency. In this paper, towards optimal VLAD representations for human action recognition from still images, we improve VLAD by tackling three important issues including empty cavity, ambiguity and pooling strategies. The empty cavity limits the performance of VLAD and has long been overlooked. We investigate the empty cavity and provide an effective solution to deal with it, which improves the performance of VLAD; we enhance the codewords with middle level of assignments which are more reliable and can provide more useful information for realistic activity; we propose incorporating the generalized max pooling to replace sum pooling in VLAD, which is more reliable for the final representation. We have conducted extensive experiments on four widely-used benchmarks to validate the proposed method for human action recognition from still images. Our method produces competitive performance with state-of-the-art algorithms. (C) 2016 Elsevier B.V. All rights reserved.
机译:从静止图像识别人类动作,最近在人类行为分析中引起了越来越多的关注,并且由于巨大的相互歧义性和内部可变性,也提出了巨大的挑战。局部聚集描述符向量(VLAD)在许多基于局部特征的图像分类任务中均实现了最先进的性能。 VLAD的巨大成功很大程度上归因于其高的描述能力和计算效率。在本文中,针对从静止图像中识别人类动作的最佳VLAD表示,我们通过解决三个重要问题(包括空洞,模糊性和合并策略)来改进VLAD。空腔限制了VLAD的性能,长期以来一直被忽略。我们研究了空洞并提供了有效的解决方案,从而改善了VLAD的性能。我们使用中等级别的分配来增强代码字,使其更加可靠,并可以为现实活动提供更多有用的信息;我们建议在VLAD中合并广义最大池以代替和池,这对于最终表示而言更为可靠。我们已经对四个广泛使用的基准进行了广泛的实验,以验证从静止图像中识别人类动作的方法。我们的方法使用最先进的算法可产生具有竞争力的性能。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号