首页>
外国专利>
Deep 3D attention long short-term memory for video-based action recognition
Deep 3D attention long short-term memory for video-based action recognition
展开▼
机译:深度3D注意力长短期记忆,用于基于视频的动作识别
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method, a computer program product, and a system are provided for video based action recognition. The system includes a processor. One or more frames from one or more video sequences are received. A feature vector for each patch of the one or more frames is generated using a deep convolutional neural network. An attention factor for the feature vectors is generated based on a within-frame attention and a between-frame attention. A target action is identified using a multi-layer deep long short-term memory process applied to the attention factor, said target action representing at least one of the one or more video sequences. An operation of a processor-based machine is controlled to change a state of the processor-based machine, responsive to the at least one of the one or more video sequences including the identified target action.
展开▼