【24h】

Learning Mutual Trust

机译:学习互信

获取原文

摘要

The multiagent learning literature has looked at iterated two-player games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy profiles. An equilibrium configuration implies that there is no motivation for one player to change its strategy if the other does not. Often, in general sum games, a higher payoff can be obtained by both players if one chooses not to respond optimally to the other player. By developing mutual trust, agents can avoid iterated best responses that will lead to a lesser payoff Nash Equilibrium. In this paper we consider 1-level agents (modelers) who select actions based on expected utility considering probability distributions over the actions of the opponent(s). We show that in certain situations, such stochastically-greedy agents can perform better (by developing mutually trusting behavior) than those that explicitly attempt to converge to Nash Equilibrium. We also experiment with an interesting action revelation strategy that can give the revealer better payoff on convergence than a non-revealing approach. By revealing, the revealer enables the opponent to agree to a more trusted equilibrium.
机译:Multi8gent学习文献研究了迭代双人游戏,以制定允许代理学习纳入纳什均衡战略概况的机制。均衡配置意味着一个玩家如果另一个球员没有改变其策略的动机。通常,在一般和游戏中,如果选择不对其他玩家最佳地响应,则可以通过两个玩家获得更高的收益。通过发展互相信任,代理商可以避免迭代最佳响应,这将导致纳什均衡较小的达努利。在本文中,我们考虑基于预期效用选择动作的1级代理(建模者)考虑到对手的行动概率分布。我们表明,在某些情况下,这种随机贪婪的代理商可以更好地执行(通过开发互相信任行为)而不是明确地汇集到纳什均衡的那些。我们还尝试了一个有趣的行动启示策略,可以使揭示者更好地收敛于不泄露的方法。通过揭示,揭示者使对手能够同意更受信任的均衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号