Learning dynamics in limited-control repeated games

Andrea Celli; Alberto Marchesi

首页> 外文期刊>Intelligenza Artificiale >Learning dynamics in limited-control repeated games

【24h】

Learning dynamics in limited-control repeated games

机译：在有限控制重复游戏中学习动态

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In imperfect-information games, a common assumption is that players can perfectly model the strategic interaction and always maintain control over their decision points.We relax this assumption by introducing the notion of limited-control repeated games. In this setting, two players repeatedly play a zero-sum extensive-form game and, at each iteration, a player may lose control over portions of her game tree. Intuitively, this can be seen as the chance player hijacking the interaction and taking control of certain decision points. What subsequently happens is no longer controllable-or even known-by the original players. We introduce pruned fictitious play, a variation of fictitious play that can be employed by the players to reach an equilibrium in limited-control repeated games.We motivate this technique with the notion of limited best response, which is the key step of the learning rule we employ.We provide a general result on the probabilistic guarantees of a limited best response with respect to the original game model. Then, we experimentally evaluate our technique and show that pruned fictitious play has good convergence properties.

机译：在不完美的信息游戏中，共同的假设是玩家可以完全模拟战略互动，并始终保持对决策点的控制。我们通过引入有限控制重复游戏的概念来放松这种假设。在这个设置中，两个玩家反复发挥零和广泛的游戏，并且在每次迭代中，玩家可能会失去对她的游戏树的部分的控制。直观地，这可以被视为劫持互动和控制某些决策点的机会球员。随后发生的是什么不再可控制 - 甚至是原始玩家。我们介绍修剪修剪的虚构游戏，玩家可以用的虚构戏剧的变化，这些游戏在有限控制重复的游戏中达到均衡。我们激励了这种技术与有限的最佳反应的概念，这是学习规则的关键步骤我们雇佣了。我们提供了对原始游戏模型有限最佳响应的概率保证的一般结果。然后，我们通过实验评估我们的技术，并显示修剪修剪的虚构游戏具有良好的收敛性。

著录项

来源
《Intelligenza Artificiale》 |2018年第2期|共13页
作者
Andrea Celli; Alberto Marchesi;
展开▼
作者单位

Dipartimento di Elettronica Informazione e Bioingegneria Politecnico di Milano Piazza Leonardo da Vinci 32 Milano Italy;

Dipartimento di Elettronica Informazione e Bioingegneria Politecnico di Milano Piazza Leonardo da Vinci 32 Milano Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
Repeated games; equilibrium computation; fictitious play; multi-agent learning; imperfect information;

机译：重复游戏;均衡计算;虚构的戏剧;多代理学习;不完美的信息;

相似文献

外文文献
中文文献
专利

1. Learning dynamics in limited-control repeated games [J] . Andrea Celli, Alberto Marchesi Intelligenza Artificiale . 2018,第2期

机译：在有限控制重复游戏中学习动态
2. Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning [J] . Jacob W. Crandall, Michael A. Goodrich Machine Learning . 2011,第3期

机译：使用强化学习来学习在重复游戏中的竞争，协调和合作
3. A dynamic Cournot-Nash game: a representation of a finitely repeated feedback game [J] . Talat S. Genc Computational management science . 2007,第2期

机译：动态的古诺·纳什游戏：有限重复反馈游戏的表示
4. Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach [C] . Shuyue Hu, Chin-Wing Leung, Ho-fung Leung Conference on Neural Information Processing Systems . 2020

机译：模拟多读Q学习在重复对称游戏中的动态：平均现场理论方法
5. Multiagent social learning in large repeated games. [D] . Oh, Jean. 2009

机译：大型重复游戏中的多主体社交学习。
6. Learning with repeated-game strategies [O] . Christos A. Ioannou, Julian Romero 2014

机译：通过重复游戏策略学习
7. Characterizing the Dynamics of Learning in Repeated Reference Games [O] . Robert D. Hawkins, Michael C. Frank, Noah D. Goodman 2020

机译：在重复参考游戏中表征学习的动态

Learning dynamics in limited-control repeated games

摘要

著录项

相似文献

相关主题

期刊订阅