首页> 外文会议>International Conference on Digitization >Enhanced Reinforcement Learning with Targeted Dropout
【24h】

Enhanced Reinforcement Learning with Targeted Dropout

机译:具有目标辍学的增强钢筋学习

获取原文

摘要

In modern ages, the study on Reinforcement Learning (RL) has driven on Deep Q-Network (DQN) optimization learning prediction and control of Markov decision processes (MDPs). In this paper, the researcher used the Targeted Dropout strategy for RLs DQN that makes straight into learning and would be necessary to deal with MDPs with huge or continuous state and action spaces. Every weight/unit update, the targeted dropout selects a set of elements and to keep only the weights/units of maximum amount, and then apply dropout to the set. It has also a common pruning strategy so focus on fast approximations, such as removing weights with the smallest value or ranking the weights/units by the sensitivity of the network design and even rating by the sensitivity of the task execution with respect to the weights/units and removing the least-sensitive ones. The result shows that the proposed algorithm for enhancing the RL's DQN is more accurate in finding the best action to learn to achieve maximum reward. The simulation presents that in a minimal run of episodes it can achieve the maximum average reward, while without Targeted Dropout it takes more runs to achieve the average reward, and throughout the assessment of the algorithm, the suggested algorithm acquires more learning in finding the large reward value.
机译:在现代衰老中,对强化学习(RL)的研究在深度Q-Network(DQN)优化学习预测和Markov决策过程中的控制(MDP)。在本文中,研究人员使用了RLS DQN的有针对性的辍学策略,这是直接学习的,并且有必要处理具有巨大或连续状态和行动空间的MDP。每个权重/单位更新,目标丢失选择一组元素并只保留最大金额的权重/单位,然后将丢弃器应用于集合。它还具有共同的修剪策略,因此侧重于快速近似,例如通过网络设计的灵敏度甚至通过对权重的任务执行的灵敏度来删除具有最小值的权重或排序权重/单位的权重。单位并删除最不敏感的。结果表明,提高RL的DQN的算法更准确地找到学习最大奖励的最佳动作。仿真显示,在最小的剧集中,它可以实现最大的平均奖励,而在没有针对性的辍学的情况下,它需要更多运行来实现平均奖励,并且在整个算法的评估中,所建议的算法在找到大的学习时获得更多的学习奖励价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号