Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping

Li Debang; Wu Huikai; Zhang Junge; Huang Kaiqi

首页> 外文期刊>IEEE Transactions on Image Processing >Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping

【24h】

Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping

机译：快速的A3RL：用于图像裁剪的审美意识对抗增强学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image cropping aims at improving the quality of images by removing unwanted outer areas, which is widely used in the photography and printing industry. Most of the previous cropping methods that do not need bounding box supervision rely on the sliding window mechanism. The sliding window method results in fixed aspect ratios and limits the shape of the cropping region. Moreover, the sliding window method usually produces lots of candidates on the input image, which is very time-consuming. Motivated by these challenges, we formulate image cropping as a sequential decision-making process and propose a reinforcement learning-based framework to address this problem, namely, Fast Aesthetics-Aware Adversarial Reinforcement Learning (Fast A3RL). Particularly, the proposed method develops an aesthetics-aware reward function that is dedicated for image cropping. Similar to human's decision-making process, we use a comprehensive state representation, including both the current observation and the historical experience. We train the agent using the actor-critic architecture in an end-to-end manner. The adversarial learning process is also applied during the training stage. The proposed method is evaluated on several popular cropping datasets, in which the images are unseen during training. The experiment results show that our method achieves the state-of-the-art performance with much fewer candidate windows and much less time compared with related methods.

机译：图像裁切旨在通过去除不需要的外部区域来提高图像质量，这在照相和打印行业已得到广泛使用。不需要边界框监视的大多数以前的裁剪方法都依赖于滑动窗口机制。滑动窗口方法导致固定的长宽比并限制裁剪区域的形状。而且，滑动窗口方法通常在输入图像上产生大量候选，这非常耗时。受这些挑战的推动，我们将图像裁剪公式化为顺序决策过程，并提出了一个基于强化学习的框架来解决此问题，即快速审美意识的对抗强化学习（Fast A3RL）。特别地，所提出的方法开发了专用于图像裁剪的审美意识奖励功能。与人类的决策过程类似，我们使用全面的状态表示形式，包括当前的观察和历史经验。我们以行为者批判的架构以端到端的方式训练代理。对抗性学习过程也将在训练阶段中应用。在几种流行的种植数据集上评估了所提出的方法，在训练过程中看不到图像。实验结果表明，与相关方法相比，我们的方法以更少的候选窗口和更少的时间实现了最新的性能。

著录项

来源
《IEEE Transactions on Image Processing》 |2019年第10期|5105-5120|共16页
作者
Li Debang; Wu Huikai; Zhang Junge; Huang Kaiqi;
展开▼
作者单位

Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China|Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China;

Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China|Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China|CAS Ctr Excellence Brain Sci & Intelligence Techn Beijing 100190 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; adversarial learning; image cropping;

机译：强化学习;对抗学习图像裁剪;

相似文献

外文文献
中文文献
专利

1. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [J] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, Data in Brief . 2021,第3期

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
2. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
3. Learning adversarial attack policies through multi-objective reinforcement learning [J] . Javier Garcia, Ruben Majadas, Fernando Fernandez Engineering Applications of Artificial Intelligence . 2020,第Nova期

机译：通过多目标强化学习学习对抗性攻击政策
4. Image Captioning using Adversarial Networks and Reinforcement Learning [C] . Shiyang Yan, Fangyu Wu, Jeremy S. Smith, International Conference on Pattern Recognition . 2018

机译：使用对抗网络和强化学习进行图像字幕
5. Adversarial Inverse Reinforcement Learning with Changing Dynamics [D] . Tirinzoni, Andrea. 2017

机译：动态变化的对抗性逆向强化学习
6. DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems [O] . Andrea Venturi, Giovanni Apruzzese, Mauro Andreolini, 2021

机译：DRELAB - 深度加强学习对抗僵尸网络：用于对僵尸网络入侵检测系统进行对抗性攻击的基准数据集
7. Fast Fractal Coding of MRI Images using Deep Reinforcement Learning [O] . Bejoy Varghese, S. Krishnakumar 2021

机译：利用深增强学习的MRI图像快速分形编码

Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping

摘要

著录项

相似文献

相关主题

期刊订阅