Reinforcement Learning on Video Summarization with Hierarchical Structure

机译：具有层次结构的视频汇总强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional video summarization approaches based on reinforcement learning have the problem that the reward can only be received after the whole summary is generated. Such kind of reward is sparse and it makes reinforcement learning hard to converge. Another problem is that labelling each shot is tedious and costly, which usually prohibits the construction of large-scale datasets. To solve these problems, we propose a weakly supervised hierarchical reinforcement learning framework, which decomposes the whole task into several subtasks to enhance the summarization quality. This framework consists of a manager network and a worker network. For each subtask, the manager is trained to set a subgoal only by a task-level binary label, which requires much fewer labels than conventional approaches. With the guide of the subgoal, the worker predicts the importance scores for video shots in the subtask by policy gradient according to both global reward and innovative defined sub-rewards to overcome the sparse problem. Experiments on two benchmark datasets show that our proposal has achieved the best performance, even better than supervised approaches.

机译：传统的基于强化学习的视频总结方法存在的问题是，只有在生成整个总结后才能获得奖励。这种奖励很少，并且使强化学习难以融合。另一个问题是标记每个镜头很繁琐且昂贵，这通常会禁止构建大规模数据集。为了解决这些问题，我们提出了一个弱监督的分层强化学习框架，该框架将整个任务分解为几个子任务，以提高摘要质量。该框架由管理者网络和工作者网络组成。对于每个子任务，管理人员仅通过任务级二进制标签就可以设置子目标，与常规方法相比，此标签所需的标签要少得多。在子目标的指导下，工作人员可以根据全局奖励和创新定义的子奖励，通过策略梯度来预测子任务中视频镜头的重要性得分，以克服稀疏问题。在两个基准数据集上进行的实验表明，我们的建议取得了最佳性能，甚至优于监督方法。

著录项

来源
《マルチメディアストレージ研究会;メディア工学研究会;ヒューマンインフォメーション研究会;映像表現コンピュータグラフィックス研究会;ITS研究会;画像工学研究会》|2020年|305-310|共6页
会议地点
作者
Yiyan CHEN; Li TAO; Xueting WANG; Toshihiko YAMASAKI;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Video summarization; hierarchical reinforcement learning;

机译：视频摘要;分层强化学习;

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning on Video Summarization with Hierarchical Structure [J] . 電子情報通信学会技術研究報告. ITS. Intelligent Transport Systems Technology . 2019,第421期

机译：钢筋综合与层次结构的概要学习
2. Crowd aware summarization of surveillance videos by deep reinforcement learning [J] . Junfeng Xu, Zhengxing Sun, Chen Ma Multimedia Tools and Applications . 2021,第4期

机译：通过深度加强学习，人群意识到监视视频的概述
3. A hierarchical self-attentive neural extractive summarizer via reinforcement learning (HSASRL) [J] . Mohsen Farida, Wang Jiayang, Al-Sabahi Kamal Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2020,第9期

机译：通过加固学习（HSASRL）的分层自闭门性神经诱惑摘要
4. Self-Attention Recurrent Summarization Network with Reinforcement Learning for Video Summarization Task [C] . Aniwat Phaphuangwittayakul, Yi Guo, Fangli Ying, IEEE International Conference on Multimedia and Expo . 2021

机译：视频摘要任务加固学习的自我关注经常性摘要网络
5. Abstractive Text Summarization Using Hierarchical Reinforcement Learning [D] . Koupaee, Mahnaz 2018

机译：使用分层强化学习的抽象文本摘要
6. Computational evidence for hierarchically structured reinforcement learning in humans [O] . Maria K. Eckstein, Anne G. E. Collins 2020

机译：人类分层结构强化学习的计算证据
7. Self-Attention Recurrent Summarization Network with Reinforcement Learning for Video Summarization Task [O] . Aniwat Phaphuangwittayakul, Yi Guo, Fangli Ying, 2021

机译：视频摘要任务加固学习的自我关注经常性摘要网络

Reinforcement Learning on Video Summarization with Hierarchical Structure

摘要

著录项

相似文献

相关主题

期刊订阅