首页> 外文会议>International Conference on Machine Learning >BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games
【24h】

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

机译:BL-WOLF:零和游戏中的损失学报框架

获取原文

摘要

We present BL-WoLF, a framework for leam-ability in repeated zero-sum games where the cost of learning is measured by the losses the learning agent accrues (rather than the number of rounds). The game is adversarially chosen from some family that the learner knows. The opponent knows the game and the learner's learning strategy. The learner tries to either not accrue losses, or to quickly learn about the game so as to avoid future losses (this is consistent with the Win or Learn Fast (WoLF) principle; BL stands for "bounded loss"). Our framework allows for both probabilistic and approximate learning. The resultant notion of BL-WoLF-leamability can be applied to any class of games, and allows us to measure the inherent disadvantage to a player that does not know which game in the class it is in. We present guaranteed BL-WoLF-learnability results for families of games with deterministic payoffs and families of games with stochastic payoffs. We demonstrate that these families are guaranteed approximately BL-WoLF-learnable with lower cost. We then demonstrate families of games (both stochastic and deterministic) that are not guaranteed BL-WoLF-learnable. We show that those families, nevertheless, are BL-WoLF-learnable. To prove these results, we use a key lemma which we derive.
机译:我们展示了BL-WOLF,在重复的零和游戏中,在重复的零和游戏中,学习成本是通过损失学习代理累积(而不是轮次的数量)来衡量学习成本的框架。来自学习者知道的一些家庭的比赛是对手的。对手知道游戏和学习者的学习策略。学习者试图不归档,或者快速了解游戏,以免避免未来的损失(这与胜利或快速(狼)原则一致; BL代表“有界损失”)。我们的框架允许概率和近似学习。 Bl-Wolf-Leability的所产生的概念可以应用于任何类别的游戏,并允许我们衡量不知道它所在的课程中的游戏的播放器的固有缺点。我们提出了保证的Bl-Wolf-Incollat​​ibess与随机收益的确定性收益和小型游戏家庭的游戏家庭的结果。我们证明这些家庭得到了较低的成本较低的狼狼。然后,我们展示了游戏的家庭(随机和确定性),不保证Bl-Wolf-可爱。我们展示这些家庭是Bl-Wolf-Insearbable。为了证明这些结果,我们使用我们派生的关键引理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号