Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection

Ali AKRAMIZADEH; Ahmad AFSHAR; Mohammad Bagher MENHAJ; Samira JAFARI

首页> 外文期刊>IEICE Transactions on Information and Systems >Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection

【24h】

Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection

机译：具有顺序动作选择的多主体系统中基于模型的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Model-based reinforcement learning uses the gathered information, during each experience, more efficiently than model-free reinforcement learning. This is especially interesting in multiagent systems, since a large number of experiences are necessary to achieve a good performance. In this paper, model-based reinforcement learning is developed for a group of self-interested agents with sequential action selection based on traditional prioritized sweeping. Every single situation of decision making in this learning process, called extensive Markov game, is modeled as n-person general-sum extensive form game with perfect information. A modified version of backward induction is proposed for action selection, which adjusts the tradeoff between selecting subgame perfect equilibrium points, as the optimal joint actions, and learning new joint actions. The algorithm is proved to be convergent and discussed based on the new results on the convergence of the traditional prioritized sweeping.

机译：与无模型的强化学习相比，基于模型的强化学习在每次体验期间更有效地使用所收集的信息。在多代理系统中，这尤其有趣，因为要获得良好的性能，需要大量的经验。在本文中，基于模型的强化学习是针对一组自私的代理开发的，这些代理具有基于传统优先清扫的顺序动作选择。在这种学习过程中，决策的每一个情况，称为广泛马尔可夫博弈，都被建模为具有完善信息的n人一般和广义博弈。提出了一种改进的后向归纳法用于动作选择，它可以调整选择子游戏的完美平衡点（作为最佳联合动作）与学习新的联合动作之间的权衡。在传统优先扫描的收敛性的新结果的基础上，证明了该算法是收敛的并进行了讨论。

著录项

来源
《IEICE Transactions on Information and Systems》 |2011年第2期|p.255-263|共9页
作者
Ali AKRAMIZADEH; Ahmad AFSHAR; Mohammad Bagher MENHAJ; Samira JAFARI;
展开▼
作者单位

Dept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technology, Tehran, Iran;

rnDept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technology, Tehran, Iran;

rnDept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technology, Tehran, Iran;

rnDept. EE., Computational Intelligence and Large Scale System Lab., Amir Kabir University of Technology, Tehran, Iran;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
multiagent systems; markov games; model-based reinforcement learning; extensive form game;

机译：多代理系统;markov游戏;基于模型的强化学习;广泛形式的游戏;

相似文献

外文文献
中文文献
专利

1. Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection [J] . Ali AKRAMIZADEH, Ahmad AFSHAR, Mohammad Bagher MENHAJ, IEICE transactions on information and systems . 2011,第2期

机译：具有顺序动作选择的多主体系统中基于模型的强化学习
2. Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection [J] . Ali Akramizadeh, Ahmad Afshar, Mohammad B. Menhaj Intelligent data analysis . 2011,第6期

机译：具有顺序动作选择的n-Person通用和多主体强化学习的探索策略
3. Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems [J] . JIANYE HAO, HO-FUNG LEUNG, ZHONG MING ACM transactions on autonomous and adaptive systems . 2015,第4期

机译：协作式多智能体系统中的多智能体增强社会学习以促进协调
4. Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions [C] . Reinaldo A. C. Bianchi, Ramon Lopez de Mantaras European Conference on Artificial Intelligence . 2010

机译：基于案例的多轴加固学习：案例作为启发式的选择
5. Explaining Collective Behavior with Dynamical Systems: Spatial Gradient Sensing in Eukaryotic Chemotaxis and Learning Dynamics in Multiagent Reinforcement Learning [D] . Shams, Daniel . 2019

机译：用动力系统解释集体行为：多核化趋化性的空间梯度传感和多核强化学习中的学习动态
6. Model-based hierarchical reinforcement learning and human action control [O] . Matthew Botvinick, Ari Weinstein 2014

机译：基于模型的层次强化学习和人类动作控制
7. Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection [O] . Ali AKRAMIZADEH, Ahmad AFSHAR, Mohammad Bagher MENHAJ, 2011

机译：顺序动作选择的多读系统中基于模型的增强学习

Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection

摘要

著录项

相似文献

相关主题

期刊订阅