Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

Rodrigo Canaan; Xianbo Gao; Julian TogeliusAndy NealenStefan Menzel

首页> 外文期刊>IEEE transactions on GAMES >Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

【24h】

Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

机译：Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Hanabi is a cooperative game that brings the problem of modeling other players to the forefront. In this game, coordinated groups of players can leverage preestablished conventions to great effect. In this article, we focus on ad hoc settings with no previous coordination between partners. We introduce a #x201C;Bayesian Meta-Agent#x201D; that maintains a belief distribution over hypotheses of partner policies. The policies that serve as initial hypotheses are generated using MAP-Elites, to ensure behavioral diversity. We evaluate an #x201C;Adaptive#x201D; version of the agent, which selects a response policy based on the updated belief distribution and a #x201C;Generalist#x201D; version, which selects a response based on the uniform prior. In short episodes of ten games with a consistent partner, the #x201C;Adaptive#x201D; version outperforms the #x201C;Generalist#x201D; when the training and evaluation populations are the same. This presents a first step toward an agent that can model its partner and adapt within a time frame that is compatible with human interaction.

著录项

来源
《IEEE transactions on GAMES》 |2023年第2期|228-241|共14页
作者
Rodrigo Canaan; Xianbo Gao; Julian TogeliusAndy NealenStefan Menzel;
展开▼
作者单位

Cal Poly State University;

NYU Tandon School of Engineering;

University of Southern CaliforniaHonda Research Institute Europe GmbH;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类电工技术;
关键词
Learning (artificial intelligence) - Naive Bayes methods; Computational and artificial intelligence - Evolutionary computation; Games; Color; Training; Adaptive systems; Artificial intelligence; Statistics; Sociology;

Generating and Adapting to Diverse Ad Hoc Partners in Hanabi

摘要

著录项

相关主题

期刊订阅