...
首页> 外文期刊>Intelligenza Artificiale >Learning dynamics in limited-control repeated games
【24h】

Learning dynamics in limited-control repeated games

机译:在有限控制重复游戏中学习动态

获取原文
获取原文并翻译 | 示例
           

摘要

In imperfect-information games, a common assumption is that players can perfectly model the strategic interaction and always maintain control over their decision points.We relax this assumption by introducing the notion of limited-control repeated games. In this setting, two players repeatedly play a zero-sum extensive-form game and, at each iteration, a player may lose control over portions of her game tree. Intuitively, this can be seen as the chance player hijacking the interaction and taking control of certain decision points. What subsequently happens is no longer controllable-or even known-by the original players. We introduce pruned fictitious play, a variation of fictitious play that can be employed by the players to reach an equilibrium in limited-control repeated games.We motivate this technique with the notion of limited best response, which is the key step of the learning rule we employ.We provide a general result on the probabilistic guarantees of a limited best response with respect to the original game model. Then, we experimentally evaluate our technique and show that pruned fictitious play has good convergence properties.
机译:在不完美的信息游戏中,共同的假设是玩家可以完全模拟战略互动,并始终保持对决策点的控制。我们通过引入有限控制重复游戏的概念来放松这种假设。在这个设置中,两个玩家反复发挥零和广泛的游戏,并且在每次迭代中,玩家可能会失去对她的游戏树的部分的控制。直观地,这可以被视为劫持互动和控制某些决策点的机会球员。随后发生的是什么不再可控制 - 甚至是原始玩家。我们介绍修剪修剪的虚构游戏,玩家可以用的虚构戏剧的变化,这些游戏在有限控制重复的游戏中达到均衡。我们激励了这种技术与有限的最佳反应的概念,这是学习规则的关键步骤我们雇佣了。我们提供了对原始游戏模型有限最佳响应的概率保证的一般结果。然后,我们通过实验评估我们的技术,并显示修剪修剪的虚构游戏具有良好的收敛性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号