Visual Code-Sentences: A New Video Representation Based on Image Descriptor Sequences

机译：视觉代码句：基于图像描述符序列的新视频表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new descriptor-sequence model for action recognition that enhances discriminative power in the spatio-temporal context, while maintaining robustness against background clutter as well as variability in inter-/intra-person behavior. We extend the framework of Dense Trajectories based activity recognition (Wang et al., 2011) and introduce a pool of dynamic Baye-sian networks (e.g., multiple HMMs) with histogram descriptors as codebooks of composite action categories represented at respective key points. The entire codebooks bound with spatio-temporal interest points constitute intermediate feature representation as basis for generic action categories. This representation scheme is intended to serve as visual code-sentences which subsume a rich vocabulary of basis action categories. Through extensive experiments using KTH, UCF Sports, and Hollywood2 datasets, we demonstrate some improvements over the state-of-the-art methods.

机译：我们提出了一种新的动作识别描述符序列模型，该模型增强了时空背景下的判别能力，同时保持了针对背景混乱的稳健性以及人际/人际行为的可变性。我们扩展了基于密集轨迹的活动识别的框架（Wang等人，2011），并引入了动态贝叶斯网络（例如多个HMM）池，其中直方图描述符作为在各个关键点表示的复合动作类别的代码本。与时空兴趣点绑定的整个密码本构成中间特征表示，作为通用动作类别的基础。此表示方案旨在用作可视代码句，包含大量基础动作类别的词汇。通过使用KTH，UCF Sports和Hollywood2数据集进行的广泛实验，我们展示了对最新方法的一些改进。

著录项

来源
《European conference on computer vision》|2012年|321-331|共11页
会议地点
作者
Yusuke Mitarai; Masakazu Matsugu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Mosaic representations of video sequences based on slice image analysis [J] . Shaolei Feng, Hanqing Lu, Songde Ma Pattern recognition letters . 2002,第5期

机译：基于切片图像分析的视频序列的马赛克表示
2. Visual Quality Assessment of Video and Image Sequences—A Human-based Approach [J] . Ashraf Al-Najdawi, Roy S. Kalawsky Journal of Signal Processing Systems . 2010,第2期

机译：视频和图像序列的视觉质量评估-基于人的方法
3. Visual Quality Assessment of Video and Image Sequences-A Human-based Approach [J] . Ashraf Al-Najdawi, Roy S. Kalawsky Journal of signal processing systems for signal, image, and video technology . 2010,第2期

机译：视频和图像序列的视觉质量评估-一种基于人的方法
4. Visual Code-Sentences: A New Video Representation Based on Image Descriptor Sequences [C] . Yusuke Mitarai, Masakazu Matsugu European Conference on Computer Vision . 2012

机译：Visual Code-句：基于图像描述符序列的新视频表示
5. Agent-based automated image descriptor approach for visually impaired people. [D] . Hassan, Mohammad Mahdi. 2007

机译：针对视障人士的基于代理的自动图像描述符方法。
6. An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model [O] . Safia Jabeen, Zahid Mehmood, Toqeer Mahmood, -1

机译：基于视觉袋模型的基于内容的有效图像检索技术
7. Figure 2: Similarity scores from human observers and different visual attention models in computing saliency map of traffic videos and images with different presentation sequences. [O] . -1

机译：图2：具有不同呈现序列的交通视频和图像计算显着性图中的人类观察者和不同视觉注意力的相似性分数。

Visual Code-Sentences: A New Video Representation Based on Image Descriptor Sequences

摘要

著录项

相似文献

相关主题

期刊订阅