首页> 美国卫生研究院文献>PLoS Clinical Trials >Multi-agent reinforcement learning with approximate model learning for competitive games

【2h】

Multi-agent reinforcement learning with approximate model learning for competitive games

机译：多主体强化学习和近似模型学习的竞技游戏

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a method for learning multi-agent policies to compete against multiple opponents. The method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients that promote cooperation between agents by communication. The learning process does not require access to opponents’ parameters or observations because the agents are trained separately from the opponents. The actor networks enable the agents to communicate using forward and backward paths while the critic network helps to train the actors by delivering them gradient signals based on their contribution to the global reward. Moreover, to address nonstationarity due to the evolving of other agents, we propose approximate model learning using auxiliary prediction networks for modeling the state transitions, reward function, and opponent behavior. In the test phase, we use competitive multi-agent environments to demonstrate by comparison the usefulness and superiority of the proposed method in terms of learning efficiency and goal achievements. The comparison results show that the proposed method outperforms the alternatives.

机译：我们提出了一种学习多代理策略以与多个对手竞争的方法。该方法由基于递归神经网络的行为者评论网络和确定性策略梯度组成，这些策略梯度通过通信促进代理之间的合作。学习过程不需要访问对手的参数或观察值，因为特工是与对手分开训练的。参与者网络使代理能够使用前进和后退路径进行通信，而评论家网络则根据参与者对全球奖励的贡献，通过向他们传递梯度信号来帮助训练参与者。此外，为了解决由于其他主体的发展而引起的不稳定，我们建议使用辅助预测网络进行近似模型学习，以对状态转换，奖励函数和对手行为进行建模。在测试阶段，我们使用竞争性的多智能体环境通过比较来证明所提出的方法在学习效率和目标成就方面的有用性和优越性。比较结果表明，该方法优于其他方法。

著录项

期刊名称 PLoS Clinical Trials
作者
Young Joon Park; Yoon Sang Cho; Seoung Bum Kim;
展开▼
作者单位

展开▼
年(卷),期 2012(14),9
年度 2012
页码 e0222215
总页数 20
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multi-agent reinforcement learning with approximate model learning for competitive games [J] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim PLoS One . 2019,第9期

机译：竞争游戏近似模型学习多功能辅助加固学习
2. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
3. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
4. A Multi-Agent Reinforcement Learning Method for a Partially-Observable Competitive Game [C] . Yoichiro Matsuno, Tatsuya Ymazaki, Shin Ishii 5th International Conference on Autonomous Agents, 5th, May 28 - Jun 1, 2001, Montreal, Canada . 2001

机译：部分可观察竞争游戏的多智能体强化学习方法
5. Large-Scale Multi-Agent Decision-Making Using Mean Field Game Theory and Reinforcement Learning [D] . Zhou, Zejian. 2021

机译：使用均值野外博弈论和强化学习的大规模多代理决策
6. Cortical mechanisms for reinforcement learning in competitive games [O] . Hyojung Seo, Daeyeol Lee 2008

机译：竞技游戏中强化学习的皮质机制
7. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2019

机译：具有竞争游戏的近似模型学习的多智能体增强学习

Multi-agent reinforcement learning with approximate model learning for competitive games

摘要

著录项

相似文献

相关主题

期刊订阅