Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meaningful Reward

机译：通过语义上有意义的奖励进行视频摘要的弱势增强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional unsupervised video summarization algorithms are usually developed in a frame level clustering manner For example, frame level diversity and representativeness are two typical clustering criteria used for unsupervised reinforcement learning-based video summarization. Inspired by recent progress in video representation techniques, we further introduce the similarity of video representations to construct a semantically meaningful reward for this task. We consider that a good summarization should also be semantically identical to its original source, which means that the semantic similarity can be regarded as an additional criterion for summarization. Through combining a novel video semantic reward with other unsupervised rewards for training, we can easily upgrade an unsupervised reinforcement learning-based video summarization method to its weakly supervised version. In practice, we first train a video classification sub-network (VCSN) to extract video semantic representations based on a category-labeled video dataset. Then we fix this VCSN and train a summary generation sub-network (SGSN) using unlabeled video data in a reinforcement learning way. Experimental results demonstrate that our work significantly surpasses other unsupervised and even supervised methods. To the best of our knowledge, our method achieves state-of-the-art performance in terms of the correlation coefficients, Kendall's and Spearman's p.

机译：传统的无监督视频摘要算法通常以帧级聚类方式开发，例如，帧级分集和代表性是用于无监督的增强学习的视频概述的两个典型的聚类标准。灵感来自近期视频表示技术的进步，我们进一步介绍了视频表示的相似性来构建针对此任务的语义有意义的奖励。我们认为，良好的摘要也应该在语义上与其原始来源相同，这意味着语义相似性可以被视为概括的额外标准。通过将新的视频语义奖励与其他无人监督的训练进行结合，我们可以轻松升级无监督的加固学习的视频摘要方法，以其弱监督的版本。在实践中，我们首先培训视频分类子网络（VCSN）以基于标记为标记的视频数据集提取视频语义表示。然后，我们将此VCSN修复并在加强学习方式中使用未标记的视频数据训练摘要生成子网（SGSN）。实验结果表明，我们的工作显着超越了其他无人汶甚至监督的方法。据我们所知，我们的方法在相关系数，肯德尔和Spearman的p方面取得了最先进的性能。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2021年|3238-3246|共9页
会议地点
作者
Zutong Li; Lei Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Correlation coefficient; Correlation; Conferences; Semantics; Diversity reception; Reinforcement learning;

机译：培训;相关系数;相关;会议;语义;多样性接待;加强学习;

相似文献

外文文献
中文文献
专利

1. Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision [J] . Seunghoon Hong, Suha Kwak, Bohyung Han IEEE Signal Processing Magazine . 2017,第6期

机译：使用深度卷积神经网络进行语义监督的弱监督学习：以最少的人工监督了解图像的语义布局
2. Crowd aware summarization of surveillance videos by deep reinforcement learning [J] . Junfeng Xu, Zhengxing Sun, Chen Ma Multimedia Tools and Applications . 2021,第4期

机译：通过深度加强学习，人群意识到监视视频的概述
3. Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation [J] . Li Yansheng, Shi Te, Zhang Yongjun, ISPRS Journal of Photogrammetry and Remote Sensing . 2021,第May期

机译：跨域遥感图像语义分割多弱监督约束下的深度语义分割网络
4. Deep Reinforcement Learning with Distributional Semantic Rewards for ive Summarization [C] . Siyao Li, Deren Lei, J Pengda Qin, International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：具有分配语义奖励的深度强化学习以实现ive总结
5. Visual Learning with Weak Supervision: Applications in Video Summarization and Person Re-identification [D] . Panda, Rameswar. 2018

机译：具有弱监督的视觉学习：视频摘要和人员重新识别中的应用
6. COVID-19 information retrieval with deep-learning based semantic search question answering and abstractive summarization [O] . Andre Esteva, Anuprit Kale, Romain Paulus, 2021

机译：Covid-19信息检索与深学习的语义搜索问题应答和抽象摘要
7. Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization [O] . Siyao Li, Deren Lei, Pengda Qin, 2019

机译：具有分布语义奖励的深度加强学习，用于抽象摘要

Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meaningful Reward

摘要

著录项

相似文献

相关主题

期刊订阅