Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

Maithra Raghu; Alex Irpan; Jacob Andreas; Bobby Kleinberg; Quoc Le; Jon Kleinberg

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

【24h】

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

机译：可以深入加强学习解决Erdos-Selfridge-Spencer游戏吗？

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep reinforcement learning has achieved many recent successes, but our understanding of its strengths and limitations is hampered by the lack of rich environments in which we can fully characterize optimal behavior, and correspondingly diagnose individual actions against such a characterization. Here we consider a family of combinatorial games, arising from work of Erdos, Selfridge, and Spencer, and we propose their use as environments for evaluating and comparing different approaches to reinforcement learning. These games have a number of appealing features: they are challenging for current learning approaches, but they form (i) a low-dimensional, simply parametrized environment where (ii) there is a linear closed form solution for optimal behavior from any state, and (iii) the difficulty of the game can be tuned by changing environment parameters in an interpretable way. We use these Erdos-Selfridge-Spencer games not only to compare different algorithms, but test for generalization, make comparisons to supervised learning, analyse multiagent play, and even develop a self play algorithm.

机译：深度加强学习取得了许多最近的成功，但我们对其优势和局限性的理解受到缺乏富裕的环境，我们可以充分地表征最佳行为，并相应地诊断各自的措施对抗这种表征。在这里，我们考虑一个来自鄂尔多斯，Selfrodridge和Spencer的工作，并提出了他们作为评估和比较加强学习方法的环境的使用。这些游戏具有许多吸引人的功能：它们对当前的学习方法有挑战性，但它们形成了（i）一种低维，简单的参数化环境，其中（ii）有一个线性关闭表单解决方案，用于从任何状态的最佳行为（iii）可以通过以可解释的方式改变环境参数来调整游戏的难度。我们不仅使用这些Erdos-Selfridge-Spencer游戏不仅可以比较不同的算法，而且对泛化进行测试，使比较监督学习，分析多学用阶段，甚至开发自行游戏算法。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共9页
作者
Maithra Raghu; Alex Irpan; Jacob Andreas; Bobby Kleinberg; Quoc Le; Jon Kleinberg;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Game Theory-Based Control System Algorithms with Real-Time Reinforcement Learning: How to Solve Multiplayer Games Online [J] . Kyriakos G. Vamvoudakis, Hamidreza Modares, Bahare Kiumarsi, Control Systems, IEEE . 2017,第1期

机译：实时强化学习的基于博弈论的控制系统算法：如何在线解决多人游戏
2. Reinforcement Renaissance The power of deep neural networks has sparked renewed interest in reinforcement learning, with applications to games, robotics, and beyond [J] . Krakovsky Marina Communications of the ACM . 2016,第8期

机译：强化文艺复兴深度神经网络的力量激发了人们对强化学习及其在游戏，机器人技术及其他领域的应用的新兴趣。
3. Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning [J] . Barriga Nicolas A., Stanescu Marius, Besoain Felipe, IEEE computational intelligence magazine . 2019,第3期

机译：通过监督策略学习，战术搜索和深度强化学习来改善RTS Game AI
4. Solving Cyber Alert Allocation Markov Games with Deep Reinforcement Learning [C] . Noah Dunstatter, Alireza Tahsini, Mina Guirguis, International conference on decision and game theory for security . 2019

机译：通过深度强化学习解决网络警报分配Markov游戏
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games [O] . Per-Arne Andersen, Morten Goodwin, Ole-Christoffer Granmo 2018

机译：深射击：实时战略游戏中深度加强学习的游戏环境

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

摘要

著录项

相似文献

相关主题

期刊订阅