首页> 外文期刊>Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing >A human action recognition approach with a novel reduced feature set based on the natural domain knowledge of the human figure
【24h】

A human action recognition approach with a novel reduced feature set based on the natural domain knowledge of the human figure

机译:一种基于人类人物自然领域知识的具有新颖的简化特征集的人类动作识别方法

获取原文
获取原文并翻译 | 示例
           

摘要

Current video surveillance systems are not designed to raise an automatic alert in case of situations that put people lives at risk such as accidents, assaults and terrorism among others. This is due to the fact that these systems are not able to analyze huge amounts of video signals at higher processing speed where these signals come from cameras installed in the worldwide network. Faced with this situation, scientific communities are combining efforts to design algorithms and hardware to accelerate the processing of video signals. However, most of the methods proposed to date are too complex to be implemented in hardware at the place where the video camera is installed. In this paper, we report a significantly reduced novel feature set to design an analysis algorithm of significant less complexity which recognizes human actions from video sequences. The proposed method is based on the natural domain knowledge of the human figure such as proportions of the human body and foot positions. The analysis is characterized by working on sub-sequences of the entire video signals, processing a small fragment of the whole image, estimating the location of the region of interest, using simple operations (sum, subtraction, multiplications, divisions), extracting a reduced number of features per frame (6 features), and using a combination of four linear classifiers (one perceptron and three support vector machines) with a hierarchical structure. The method is evaluated on two of the datasets cited in the human action recognition literature, the Weizmann and the UIUC datasets. Results show that for the case of the Weizmann dataset, the correct classification rate (CCR) is 99.95% when the LOOCV Protocol is used and 98.38% for the case of Protocol 60-40, which is comparable or even higher than that of current state-of-the-art methods. Confusion matrices were also obtained for the UIUC dataset, where the obtained CCR is 100% for the case of the LOOCV Protocol and 99.35% when Protocol 60-40 is used. The experimental results are promising with much fewer features (between 85 and 113 times less features), compared with other methods, and the possibility of processing more than 200 fps. (C) 2014 Elsevier B.V. All rights reserved.
机译:当前的视频监视系统并未设计为在发生使人的生命处于危险之中的情况(例如事故,袭击和恐怖主义等)时发出自动警报。这是由于以下事实:这些系统无法以更高的处理速度分析大量视频信号,而这些信号来自安装在全球网络中的摄像机。面对这种情况,科学界正在共同努力设计算法和硬件,以加速视频信号的处理。但是,迄今为止提出的大多数方法都太复杂了,无法在安装摄像机的地方以硬件实现。在本文中,我们报告了一种大大简化的新颖功能集,旨在设计一种复杂度大大降低的分析算法,该算法可识别视频序列中的人类动作。所提出的方法基于人的自然领域知识,例如人体的比例和脚的位置。该分析的特征是处理整个视频信号的子序列,处理整个图像的一小部分,使用简单的操作(求和,减法,乘法,除法)估计感兴趣区域的位置,提取出一个每帧特征数(6个特征),并结合使用具有分层结构的四个线性分类器(一个感知器和三个支持向量机)。该方法是在人类动作识别文献中引用的两个数据集(Weizmann和UIUC数据集)上进行评估的。结果表明,对于Weizmann数据集,使用LOOCV协议时的正确分类率(CCR)为99.95%,而对于协议60-40则为98.38%,这与当前状态相当或什至更高最先进的方法。还为UIUC数据集获得了混淆矩阵,其中对于LOOCV协议,获得的CCR为100%,而当使用协议60-40时,则为99.35%。与其他方法相比,实验结果具有更少的功能(功能减少了85到113倍),并且处理速度超过200 fps的前景令人鼓舞。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号