Learning Video Actions in Two Stream Recurrent Neural Network

Hassan Ehtesham

首页> 外文期刊>Pattern recognition letters >Learning Video Actions in Two Stream Recurrent Neural Network

【24h】

Learning Video Actions in Two Stream Recurrent Neural Network

机译：Learning Video Actions in Two Stream Recurrent Neural Network

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The paper investigates the Long short term memory (LSTM) networks for human action recognition in videos. In spite of significant progress in the field, recognizing actions in real-world videos is a challenging task due to the spatial and temporal variations within and across video clips. We propose a novel two-stream deep network for action recognition by applying the LSTM for learning the fusion of spatial and temporal feature streams. The LSTM type of Recurrent neural network by design possess unique capability to preserve long range context in temporal streams. The proposed method capitalizes on LSTMs memory attribute to fuse the input streams in high-dimensional space exploring the spatial and temporal correlations. The temporal stream input is defined on the LSTM learned deep features summarizing the input frame sequence. Our approach of combining the convolutional features based spatial stream and the deep features based temporal stream in LSTM network efficiently captures the long range temporal dependencies in video streams. We perform primary evaluation of the proposed approach on UCF101, HMBD51 and Kinetics400 datasets achieving competitive recognition accuracy of 93.1, 71.3 and 74.6 respectively. (c) 2021 Elsevier B.V. All rights reserved.

著录项

来源
《Pattern recognition letters》 |2021年第11期|200-208|共9页
作者
Hassan Ehtesham;
展开▼
作者单位

Kuwait Coll Sci & Technol, Dept Comp Sci & Engn, Kuwait, Kuwait;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Action recognition; Two-stream deep network; LSTM; Feature fusion;

Learning Video Actions in Two Stream Recurrent Neural Network

摘要

著录项

相关主题

期刊订阅