No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

机译：没有度量标准是完美的：视觉叙事的对抗性奖励学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Though impressive results have been achieved in visual captioning, the task of generating abstract stories from photo streams is still a little-tapped problem. Different from captions, stories have more expressive language styles and contain many imaginary concepts that do not appear in the images. Thus it poses challenges to behavioral cloning algorithms. Furthermore, due to the limitations of automatic metrics on evaluating story quality, reinforcement learning methods with hand-crafted rewards also face difficulties in gaining an overall performance boost. Therefore, we propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function. Though automatic evaluation indicates slight performance boost over state-of-the-art (SOTA) methods in cloning expert behaviors, human evaluation shows that our approach achieves significant improvement in generating more human-like stories than SOTA systems. Code will be made available here.

机译：尽管在视觉字幕上取得了令人印象深刻的结果，但是从照片流生成抽象故事的任务仍然是一个尚未开发的问题。与字幕不同，故事具有更具表现力的语言风格，并且包含许多未出现在图像中的虚构概念。因此，它对行为克隆算法提出了挑战。此外，由于自动度量标准在评估故事质量方面的局限性，具有手工制作奖励的强化学习方法在获得整体绩效提升方面也面临着困难。因此，我们提出了一种对抗性奖励学习（AREL）框架，以从人类的示威中学习隐式的奖励功能，然后利用所学习的奖励功能来优化策略搜索。尽管自动评估表明在克隆专家行为方面与最新技术（SOTA）方法相比性能略有提高，但是人类评估表明，与SOTA系统相比，我们的方法在生成更多类似人的故事方面取得了显着改进。代码将在此处提供。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|899-909|共11页
会议地点
作者
Xin Wang; Wenhu Chen; Yuan-Fang Wang; William Yang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The reward of seeing: Different types of visual reward and their ability to modify oculomotor learning [J] . Annegret Meermeier, Svenja Gremmler, Kerstin Richert, Journal of vision . 2017,第12期

机译：视觉奖励：不同类型的视觉奖励及其改变动眼学习的能力
2. Decision tree pairwise metric learning against adversarial attacks [J] . Benjamin Appiah, Zhiguang Qin, Ayidzoe Mighty Abra, Computers & Security . 2021,第Jula期

机译：决策树成对度量学习对抗对抗攻击
3. Boosting Unconstrained Palmprint Recognition With Adversarial Metric Learning [J] . Jinsong Zhu, Dexing Zhong, Kai Luo IEEE Transactions on Biometrics, Behavior, and Identity Science . 2020,第4期

机译：通过对抗对抗度量学习提高无约束的掌纹识别
4. No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling [C] . Xin Wang, Wenhu Chen, Yuan-Fang Wang, Annual meeting of the Association for Computational Linguistics . 2018

机译：没有指标是完美的：对抗视觉讲故事的对抗奖励学习
5. Roles of reward, memory, and cognitive control on visual perceptual learning and decision-making. [D] . Kim, Dongho. 2013

机译：奖励，记忆和认知控制在视觉感知学习和决策中的作用。
6. Learning Perfectly Secure Cryptography to Protect Communications with Adversarial Neural Cryptography [O] . Murilo Coutinho, Robson de Oliveira Albuquerque, Fábio Borges, 2018

机译：学习完美安全的密码学以对抗性神经密码学保护通信
7. No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling [O] . Xin Wang, Wenhu Chen, Yuan-Fang Wang, 2018

机译：没有指标是完美的：对抗视觉讲故事的对抗奖励学习

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

摘要

著录项

相似文献

相关主题

期刊订阅