Predicting the Where and What of actors and actions through Online Action Localization

机译：通过在线行动本地化预测行动者和行动的何处和行动

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a novel approach to tackle the challenging problem of 'online action localization' which entails predicting actions and their locations as they happen in a video. Typically, action localization or recognition is performed in an offline manner where all the frames in the video are processed together and action labels are not predicted for the future. This disallows timely localization of actions - an important consideration for surveillance tasks. In our approach, given a batch of frames from the immediate past in a video, we estimate pose and over-segment the current frame into superpixels. Next, we discriminatively train an actor foreground model on the superpixels using the pose bounding boxes. A Conditional Random Field with superpixels as nodes, and edges connecting spatio-temporal neighbors is used to obtain action segments. The action confidence is predicted using dynamic programming on SVM scores obtained on short segments of the video, thereby capturing sequential information of the actions. The issue of visual drift is handled by updating the appearance model and pose refinement in an online manner. Lastly, we introduce a new measure to quantify the performance of action prediction (i.e. online action localization), which analyzes how the prediction accuracy varies as a function of observed portion of the video. Our experiments suggest that despite using only a few frames to localize actions at each time instant, we are able to predict the action and obtain competitive results to state-of-the-art offline methods.

机译：本文提出了一种解决“在线行动定位”挑战性问题的新方法，这需要在视频中发生预测行动及其位置。通常，以离线方式执行动作定位或识别，其中视频中的所有帧都被处理在一起，并且未对未来预测动作标签。这不允许及时本地化行动 - 对监督任务的重要考虑因素。在我们的方法中，给出了一批来自视频中的直接过去的帧，我们估计了将当前帧的姿势和过度分段为超像素。接下来，我们使用姿势边界盒差异地在超像素上训练演员前景模型。使用SuperPixels作为节点的条件随机字段，以及连接时空邻居的边缘来获得动作段。使用动态编程在视频的短段上获得的SVM分数上进行动作置信度，从而捕获动作的顺序信息。通过以在线方式更新外观模型和姿势细化来处理视觉漂移问题。最后，我们介绍了一种新的度量来量化动作预测的性能（即在线行动定位），其分析了预测精度如何随着视频部分的函数而变化。我们的实验表明，尽管只使用了几个帧来定位在每次即时的行动，我们都能够预测竞争结果，并将最先进的离线方法获得竞争结果。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|2276-3033p|共10页
会议地点
作者
Khurram Soomro; Haroon Idrees; Mubarak Shah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. What drives interaction in political actors' Facebook posts? Profile and content predictors of user engagement and political actors' reactions [J] . Heiss Raffael, Schmuck Desiree, Matthes Joerg Information Communication & Society . 2019,第9a10期

机译：是什么推动了政治角色在Facebook帖子中的互动？用户参与度和政治行为者反应的概况和内容预测因素
2. What drives interaction in political actors' Facebook posts? Profile and content predictors of user engagement and political actors' reactions [J] . Heiss Raffael, Schmuck Desiree, Matthes Joerg Information Communication & Society . 2019,第9a10期

机译：什么推动政治演员的Facebook帖子中的互动？用户参与和政治参与者反应的简介和内容预测因素
3. Online Localization and Prediction of Actions and Interactions [J] . Soomro Khurram, Idrees Haroon, Shah Mubarak IEEE Transactions on Pattern Analysis and Machine Intelligence . 2019,第2期

机译：在线本地化和行为与互动的预测
4. Predicting the Where and What of Actors and Actions through Online Action Localization [C] . Khurram Soomro, Haroon Idrees, Mubarak Shah IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：通过在线动作本地化预测参与者和动作的位置和位置
5. The Living in America Muslim Life Stress, Coping and Life Satisfaction Study: An Online Mixed Methods Study of Islamophobic Discrimination, Microaggressions, and Predictors of Life Satisfaction [D] . TIrhi, Susan Yasen. 2019

机译：美国生活穆斯林生活压力，应对和生活满意度研究：伊斯兰歧视，微侵略和生活满意度预测因素的在线混合方法研究
6. Neural basis of understanding communicative actions: Changes associated with knowing the actor’s intention and the meanings of the actions [O] . Riikka Möttönen, Harry Farmer, Kate E. Watkins -1

机译：理解交往行为的神经基础：与了解演员的意图和行为含义相关的变化
7. Online Localization and Prediction of Actions and Interactions [O] . Soomro, Khurram, Idrees, Haroon, Shah, Mubarak 2016

机译：在线本地化和行动与互动的预测
8. Actors, Actions, and Initiative in Normative System Specification [R] . Wieringa, R. J., Meyer, J. J. C. 1991

机译：规范系统规范中的行动者，行动和倡议

Predicting the Where and What of actors and actions through Online Action Localization

摘要

著录项

相似文献

相关主题

期刊订阅