【24h】

Task Planning in 'Block World' with Deep Reinforcement Learning

机译:“街区世界”任务规划,深增强学习

获取原文

摘要

At the moment reinforcement learning have advanced significantly with discovering new techniques and instruments for training. This paper is devoted to the application convolutional and recurrent neural networks in the task of planning with reinforcement learning problem. The aim of the work is to check whether the neural networks are fit for this problem. During the experiments in a block environment the task was to move blocks to obtain the final arrangement which was the target. Significant part of the problem is connected with the determining on the reward function and how the results are depending in reward's calculation. The current results show that without modifying the initial problem into more straightforward ones neural networks didn't demonstrate stable learning process. In the paper a modified reward function with sub-targets and euclidian reward calculation was used for more precise reward determination. Results have shown that none of the tested architectures were not able to achieve goal.
机译:目前强化学习与发现新的技术和仪器培训显著进展。本文致力于在强化学习问题规划任务应用卷积和复发性神经网络。这项工作的目的是检查神经网络是否适合这个问题。期间在块环境的实验的任务是移动块,以获得最终布置,其是目标。问题的显著一部分与所述回报函数的确定和如何结果取决于在奖励的计算连接。目前的结果显示,在不修改初始问题转化为更直接了神经网络并没有表现出稳定的学习过程。在纸被用于更精确地确定奖励与子目标和欧几里德计算报酬的改性回报函数。结果显示,没有一个测试架构无法实现的目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号