首页> 外文期刊>IEEE Transactions on Image Processing >Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping
【24h】

Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping

机译:快速的A3RL:用于图像裁剪的审美意识对抗增强学习

获取原文
获取原文并翻译 | 示例
           

摘要

Image cropping aims at improving the quality of images by removing unwanted outer areas, which is widely used in the photography and printing industry. Most of the previous cropping methods that do not need bounding box supervision rely on the sliding window mechanism. The sliding window method results in fixed aspect ratios and limits the shape of the cropping region. Moreover, the sliding window method usually produces lots of candidates on the input image, which is very time-consuming. Motivated by these challenges, we formulate image cropping as a sequential decision-making process and propose a reinforcement learning-based framework to address this problem, namely, Fast Aesthetics-Aware Adversarial Reinforcement Learning (Fast A3RL). Particularly, the proposed method develops an aesthetics-aware reward function that is dedicated for image cropping. Similar to human's decision-making process, we use a comprehensive state representation, including both the current observation and the historical experience. We train the agent using the actor-critic architecture in an end-to-end manner. The adversarial learning process is also applied during the training stage. The proposed method is evaluated on several popular cropping datasets, in which the images are unseen during training. The experiment results show that our method achieves the state-of-the-art performance with much fewer candidate windows and much less time compared with related methods.
机译:图像裁切旨在通过去除不需要的外部区域来提高图像质量,这在照相和打印行业已得到广泛使用。不需要边界框监视的大多数以前的裁剪方法都依赖于滑动窗口机制。滑动窗口方法导致固定的长宽比并限制裁剪区域的形状。而且,滑动窗口方法通常在输入图像上产生大量候选,这非常耗时。受这些挑战的推动,我们将图像裁剪公式化为顺序决策过程,并提出了一个基于强化学习的框架来解决此问题,即快速审美意识的对抗强化学习(Fast A3RL)。特别地,所提出的方法开发了专用于图像裁剪的审美意识奖励功能。与人类的决策过程类似,我们使用全面的状态表示形式,包括当前的观察和历史经验。我们以行为者批判的架构以端到端的方式训练代理。对抗性学习过程也将在训练阶段中应用。在几种流行的种植数据集上评估了所提出的方法,在训练过程中看不到图像。实验结果表明,与相关方法相比,我们的方法以更少的候选窗口和更少的时间实现了最新的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号