...
首页> 外文期刊>IEEE transactions on GAMES >Generating and Adapting to Diverse Ad Hoc Partners in Hanabi
【24h】

Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

机译:Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

获取原文
获取原文并翻译 | 示例
           

摘要

Hanabi is a cooperative game that brings the problem of modeling other players to the forefront. In this game, coordinated groups of players can leverage preestablished conventions to great effect. In this article, we focus on ad hoc settings with no previous coordination between partners. We introduce a #x201C;Bayesian Meta-Agent#x201D; that maintains a belief distribution over hypotheses of partner policies. The policies that serve as initial hypotheses are generated using MAP-Elites, to ensure behavioral diversity. We evaluate an #x201C;Adaptive#x201D; version of the agent, which selects a response policy based on the updated belief distribution and a #x201C;Generalist#x201D; version, which selects a response based on the uniform prior. In short episodes of ten games with a consistent partner, the #x201C;Adaptive#x201D; version outperforms the #x201C;Generalist#x201D; when the training and evaluation populations are the same. This presents a first step toward an agent that can model its partner and adapt within a time frame that is compatible with human interaction.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号