...
首页> 外文期刊>IEEE Transactions on Games >Crawling in Rogue's Dungeons With Deep Reinforcement Techniques
【24h】

Crawling in Rogue's Dungeons With Deep Reinforcement Techniques

机译:爬行罗伊的地下野,深入加固技术

获取原文
获取原文并翻译 | 示例
           

摘要

This paper is a report of our extensive experimentation, during the last two years, of deep reinforcement techniques for training an agent to move in the dungeons of the famous Rogue video game. The challenging nature of the problem is tightly related to the procedural, random generation of new dungeon maps at each level, which forbids any form of level-specific learning and forces us to address the navigation problem in its full generality. Other interesting aspects of the game from the point of view of automatic learning are the partially observable nature of the problem since maps are initially not visible and get discovered during exploration, and the problem of sparse rewards, requiring the acquisition of complex, nonreactive behaviors involving memory and planning. In this paper, we develop on previous works to make a more systematic comparison of different learning techniques, focusing in particular on Asynchronous Advantage Actor-Critic and Actor-Critic with Experience Replay (ACER). In a game like Rogue, sparsity of rewards is mitigated by the variability of the dungeon configurations (sometimes, by luck, exit is at hand); if this variability can be tamed-as ACER, better than other algorithms, seems able to do-the problem of sparse rewards can be overcome without any need of intrinsic motivations.
机译:本文是我们广泛的实验,在过去两年中,培训代理人在着名的流氓视频游戏的地下城的深度加固技术中报告。问题的挑战性质与每个级别的程序,随机生成的新地牢映射紧密相关,这禁止任何形式的级别专门的学习,并迫使我们在其全部普遍地解决导航问题。从自动学习的角度来看,游戏的其他有趣方面是问题的部分可观察性质,因为地图最初不可见并在探索期间被发现,以及稀疏奖励的问题,需要获取复杂的,不反应行为涉及记忆和规划。在本文中,我们开发了以前的作品,以便更系统的比较不同的学习技术,特别是对异步优势演员 - 评论家和演员 - 评论家与经验重播(宏碁)。在像流氓这样的游戏中,通过地牢配置的可变性减轻了奖励的稀疏性(有时,运气,退出是手头的);如果这种可变性可以驯服 - 作为宏碁,比其他算法更好,似乎能够做出稀疏奖励的问题,而无需任何内在动机。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号