首页> 外文会议>International Joint Conference on Neural Networks >Strategy Selection in Complex Game Environments Based on Transfer Reinforcement Learning
【24h】

Strategy Selection in Complex Game Environments Based on Transfer Reinforcement Learning

机译:基于转移强化学习的复杂游戏环境中的战略选择

获取原文

摘要

Boosting the learning process in the new task by making use of previously obtained knowledge has been a challenging task in many fields of industrial engineering and scientific. In this paper, we propose a transfer reinforcement learning model with knowledge Inheritance and decision-making Assistance (trIA). In the stage of knowledge inheritance, trIA adopts a model that employs a simultaneous multi-task and multi-instance learning strategy to compress acquired experts knowledge from distinct task into a global multi-task agent. In the stage of decision-making assistance, trIA adopts a dual-column progressive neural network framework to fully utilize the previous knowledge in the global multi-task agent and the acquired knowledge in the new task. The experimental results on the Atari domain demonstrate that the proposed knowledge inheritance model can performed at nearly the same level as the experts on the distinct source task environments. The results also demonstrate that the decision-making assistance model can transfer knowledge from the source tasks to the target tasks effectively. Moreover, the comparative results with the state-ofthe-art algorithms validate the effectiveness of the proposed trIA for strategy selection in complex game environments.
机译:通过利用先前获得的知识在新任务中提高了新任务的学习过程,这是工业工程和科学的许多领域的具有挑战性的任务。在本文中,我们提出了一种具有知识继承和决策援助(TRIA)的转移强化学习模型。在知识继承的阶段,Tria采用一个模型,该模型采用同时多任务和多实例学习策略来压缩从独特的任务到全局多任务代理的获取专家知识。在决策援助阶段,Tria采用双列逐行的神经网络框架,充分利用全球多任务代理和新任务中获取的知识中的先前知识。 ATARI域的实验结果表明,所提出的知识继承模型可以以与不同源任务环境的专家在几乎相同的水平上进行。结果还表明决策辅助模型可以有效地将知识从源任务转移到目标任务。此外,具有最新算法的比较结果验证了复杂游戏环境中提出的三国战略选择的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号