A multi-agent reinforcement learning method with learning of other agents for competitive game

Yoichiro Matsuno; Tatsuya Yamazaki; Jun Matsuda; Shin Ishii

首页> 外文期刊>電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing >A multi-agent reinforcement learning method with learning of other agents for competitive game

【24h】

A multi-agent reinforcement learning method with learning of other agents for competitive game

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This report proposes a reinforcement learning (RL) method based on the Actor-Critic architecture, which can be applied to partially-observable multi-agent competitive games. As an example, we consider a card game "Hearts". The RL then becomes a partially-observable Markov decision process (POMDP). In our method, a single Hearts game is divided into three stages, and three actors are prepared so that one of them plays and learns separately in each stage. In particular, the actor for the middle stage plays so as to enlarge the expected temporal-difference error, which is calculated using the evaluation function approximated by the critic and the estimated state transition. Computer experiments with heuristic players show that our RL method works well.

机译：本报告提出了一种基于Actor-Critic体系结构的强化学习（RL）方法，该方法可应用于部分可观察到的多主体竞争性游戏。例如，我们考虑一个纸牌游戏“心”。 RL随后成为部分可观察到的马尔可夫决策过程（POMDP）。在我们的方法中，将单个Hearts游戏划分为三个阶段，并准备了三个演员，以便其中一个在每个阶段分别扮演和学习。特别地，用于中间阶段的演员进行表演以扩大预期的时差误差，该误差是使用评论者近似的评估函数和估计的状态转换来计算的。启发式播放器的计算机实验表明，我们的RL方法效果很好。

著录项

来源
《電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing》 |2000年第688期|共8页
作者
Yoichiro Matsuno; Tatsuya Yamazaki; Jun Matsuda; Shin Ishii;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类人工智能理论;
关键词
Multi-agent; Reinforcement learning; Competitive game; Actor-critic model; Opponentagent model inference;

机译：多主体强化学习竞争博弈行为批评模型主体模型推理;

相似文献

外文文献
中文文献
专利

1. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
2. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体增强学习方法，具有竞争游戏的其他代理商
3. Multi-agent reinforcement learning with approximate model learning for competitive games [J] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim PLoS One . 2019,第9期

机译：竞争游戏近似模型学习多功能辅助加固学习
4. A Multi-Agent Reinforcement Learning Method for a Partially-Observable Competitive Game [C] . Yoichiro Matsuno, Tatsuya Ymazaki, Shin Ishii 5th International Conference on Autonomous Agents, 5th, May 28 - Jun 1, 2001, Montreal, Canada . 2001

机译：部分可观察竞争游戏的多智能体强化学习方法
5. Large-Scale Multi-Agent Decision-Making Using Mean Field Game Theory and Reinforcement Learning [D] . Zhou, Zejian. 2021

机译：使用均值野外博弈论和强化学习的大规模多代理决策
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2019

机译：具有竞争游戏的近似模型学习的多智能体增强学习

A multi-agent reinforcement learning method with learning of other agents for competitive game

摘要

著录项

相似文献

相关主题

期刊订阅