首页> 外文会议>Annual meeting of the Association for Computational Linguistics >No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
【24h】

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

机译:没有度量标准是完美的:视觉叙事的对抗性奖励学习

获取原文

摘要

Though impressive results have been achieved in visual captioning, the task of generating abstract stories from photo streams is still a little-tapped problem. Different from captions, stories have more expressive language styles and contain many imaginary concepts that do not appear in the images. Thus it poses challenges to behavioral cloning algorithms. Furthermore, due to the limitations of automatic metrics on evaluating story quality, reinforcement learning methods with hand-crafted rewards also face difficulties in gaining an overall performance boost. Therefore, we propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function. Though automatic evaluation indicates slight performance boost over state-of-the-art (SOTA) methods in cloning expert behaviors, human evaluation shows that our approach achieves significant improvement in generating more human-like stories than SOTA systems. Code will be made available here.
机译:尽管在视觉字幕上取得了令人印象深刻的结果,但是从照片流生成抽象故事的任务仍然是一个尚未开发的问题。与字幕不同,故事具有更具表现力的语言风格,并且包含许多未出现在图像中的虚构概念。因此,它对行为克隆算法提出了挑战。此外,由于自动度量标准在评估故事质量方面的局限性,具有手工制作奖励的强化学习方法在获得整体绩效提升方面也面临着困难。因此,我们提出了一种对抗性奖励学习(AREL)框架,以从人类的示威中学习隐式的奖励功能,然后利用所学习的奖励功能来优化策略搜索。尽管自动评估表明在克隆专家行为方面与最新技术(SOTA)方法相比性能略有提高,但是人类评估表明,与SOTA系统相比,我们的方法在生成更多类似人的故事方面取得了显着改进。代码将在此处提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号