Let’s be Honest: An Optimal No-Regret Framework for Zero-Sum Games

Ehsan Asadi Kangarshahi; Ya-Ping Hsieh; Mehmet Fatih Sahin; Volkan Cevher

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Let’s be Honest: An Optimal No-Regret Framework for Zero-Sum Games

【24h】

Let’s be Honest: An Optimal No-Regret Framework for Zero-Sum Games

机译：坦白地说：零和游戏的最佳无遗憾框架

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We revisit the problem of solving two-player zero-sum games in the decentralized setting. We propose a simple algorithmic framework that simultaneously achieves the best rates for honest regret as well as adversarial regret, and in addition resolves the open problem of removing the logarithmic terms in convergence to the value of the game. We achieve this goal in three steps. First, we provide a novel analysis of the optimistic mirror descent (OMD), showing that it can be modified to guarantee fast convergence for both honest regret and value of the game, when the players are playing collaboratively. Second, we propose a new algorithm, dubbed as robust optimistic mirror descent (ROMD), which attains optimal adversarial regret without knowing the time horizon beforehand. Finally, we propose a simple signaling scheme, which enables us to bridge OMD and ROMD to achieve the best of both worlds. Numerical examples are presented to support our theoretical claims and show that our non-adaptive ROMD algorithm can be competitive to OMD with adaptive step-size selection.

机译：我们重新讨论在分散的环境中解决两人零和游戏的问题。我们提出了一个简单的算法框架，该框架可同时实现诚实后悔和对抗后悔的最佳比率，此外还解决了消除对数项以达到游戏价值的开放性问题。我们分三步实现这一目标。首先，我们对乐观镜像后裔（OMD）进行了新颖的分析，显示了可以进行修改，以确保在玩家进行协作游戏时，既可以为诚实的遗憾又可以为游戏的价值提供快速的融合。其次，我们提出了一种称为鲁棒乐观镜像下降（ROMD）的新算法，该算法无需事先了解时间范围即可获得最佳的对抗遗憾。最后，我们提出了一种简单的信令方案，该方案使我们能够桥接OMD和ROMD以实现两全其美。数值例子证明了我们的理论主张，并表明我们的非自适应ROMD算法在自适应步长选择下可以与OMD竞争。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共9页
作者
Ehsan Asadi Kangarshahi; Ya-Ping Hsieh; Mehmet Fatih Sahin; Volkan Cevher;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Let’s be Honest: An Optimal No-Regret Framework for Zero-Sum Games [J] . Ehsan Asadi Kangarshahi, Ya-Ping Hsieh, Mehmet Fatih Sahin, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：坦白地说：零和游戏的最佳无遗憾框架
2. Near-optimal no-regret algorithms for zero-sum games [J] . Daskalakis Constantinos, Deckelbaum Alan, Kim Anthony Games and economic behavior . 2015,第Null期

机译：零和游戏的近最佳无悔算法
3. Honest Signaling in Zero-Sum Games Is Hard, and Lying Is Even Harder [J] . Aviad Rubinstein LIPIcs : Leibniz International Proceedings in Informatics . 2017,第1期

机译：零和游戏中的诚实信号很难，说谎更难
4. Near-Optimal No-Regret Algorithms for Zero-Sum Games [C] . Constantinos Daskalakis, Alan Deckelbaum, Anthony Kim Annual ACM-SIAM Symposium on Discrete Algorithms . 2011

机译：零和游戏的近乎最佳无遗憾算法
5. Zero-sum Games & Zero-sum Frames: Employee Cognitive Consequences of Financial Firm Performance [D] . Brown, Daniel Albert. 2019

机译：零和游戏和零和框架：金融公司性能的员工认知后果
6. The politics of zero-sum thinking: The relationship between political ideology and the belief that life is a zero-sum game [O] . Shai Davidai, Martino Ongis 2019

机译：零和思想的政治：政治意识形态与生活是零和游戏的信念之间的关系
7. Near-Optimal No-Regret Algorithms for Zero-Sum Games [O] . Constantinos Daskalakis, Alan Deckelbaum, Anthony Kim 2014

机译：零和博弈的近似无遗憾算法

Let’s be Honest: An Optimal No-Regret Framework for Zero-Sum Games

摘要

著录项

相似文献

相关主题

期刊订阅