首页> 外文会议>AAAI Conference on Artificial Intelligence >Path Planning Problems with Side Observations - When Colonels Play Hide-and-Seek
【24h】

Path Planning Problems with Side Observations - When Colonels Play Hide-and-Seek

机译:侧视图的路径规划问题 - 当上校播放捉迷藏时

获取原文

摘要

Resource allocation games such as the famous Colonel Blotto (CB) and Hide-and-Seek (HS) games are often used to model a large variety of practical problems, but only in their one-shot versions. Indeed, due to their extremely large strategy space, it remains an open question how one can efficiently learn in these games. In this work, we show that the online CB and HS games can be cast as path planning problems with side-observations (SOPPP): at each stage, a learner chooses a path on a directed acyclic graph and suffers the sum of losses that are adversarially assigned to the corresponding edges; and she then receives semi-bandit feedback with side-observations (i.e., she observes the losses on the chosen edges plus some others). We propose a novel algorithm, EXP3-OE, the first-of-its-kind with guaranteed efficient running time for SOPPP without requiring any auxiliary oracle. We provide an expected-regret bound of EXP3-OE in SOPPP matching the order of the best benchmark in the literature. Moreover, we introduce additional assumptions on the observability model under which we can further improve the regret bounds of EXP3-OE. We illustrate the benefit of using EXP3-OE in SOPPP by applying it to the online CB and HS games.
机译:资源分配游戏,如着名的上校Blotto(CB)和隐藏和寻求(HS)游戏通常用于模拟各种实际问题,但只有在他们的一拍版本中。事实上,由于他们非常大的策略空间,它仍然是一个开放的问题,如何在这些游戏中有效地学习。在这项工作中,我们表明,在线CB和HS游戏可以作为侧视图的路径规划问题(SOPPP):在每个阶段,学习者在指向的非循环图上选择路径,并遭受损失的总和对相应的边缘进行对接的;然后,她接收与侧视图的半匪盗反馈(即,她观察所选择的边缘加上其他一些的损失)。我们提出了一种新颖的算法,EXP3-OE,首先具有SOPPP的有效运行时间,而无需任何辅助甲骨文。我们在SOPPP中提供了EXP3-OE的预期遗憾,符合文献中最佳基准的顺序。此外,我们对观察性模型的额外假设介绍了我们可以进一步改善Exp3-OE的遗憾范围。我们通过将Soppp应用于在线CB和HS游戏,说明使用Exp3-OE的益处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号