Online Learning with Switching Costs and Other Adaptive Adversaries

机译：以交换成本和其他适应对手在线学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the power of different types of adaptive (nonoblivious) adversaries in the setting of prediction with expert advice, under both full-information and bandit feedback. We measure the player's performance using a new notion of regret, also known as policy regret, which better captures the adversary's adaptiveness to the player's behavior. In a setting where losses are allowed to drift, we characterize -in a nearly complete manner- the power of adaptive adversaries with bounded memories and switching costs. In particular,we show that with switching costs, the attainable rate with bandit feedback is Θ-tilde(T~(2/3)). Interestingly, this rate is significantly worse than the Θ(T~(1/2)) rate attainable with switching costs in the full-information case. Via a novel reduction from experts to bandits, we also show that a bounded memory adversary can force Θ-tilde(T~(2/3)) regret even in the full information case, proving that switching costs are easier to control than bounded memory adversaries. Our lower bounds rely on a new stochastic adversary strategy that generates loss processes with strong dependencies.

机译：我们研究不同类型的专家建议预测的设置自适应（nonoblivious）对手的力量，同时支持完全的信息和土匪反馈下。我们使用的遗憾了新的概念，也被称为政策的遗憾，这更好地捕捉对手的适应能力，以玩家的行为衡量球员的表现。在损失被允许漂移的设置，我们描述-in一个几乎完整的方式载有界的记忆和转换成本适应对手的力量。尤其是，我们表明，转换成本，与匪反馈可达到的速度是Θ-波浪号（T〜（2/3））。有趣的是，这个速度比Θ（T〜（1/2））率可达到与切换在全信息的情况下成本显著恶化。通过专家土匪一种新颖的减少，我们还表明，有界内存对手可以强制Θ-波浪号（T〜（2/3））感到遗憾，即使在完全信息的情况下，证明了转换成本要比界内存更容易控制对手。我们的下界依赖于具有很强的依赖性产生损失过程一个新的随机对手的策略。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Nicolo Cesa-Bianchi; Ofer Dekel; Ohad Shamir;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Online learning with feedback graphs and switching costs [J] . Anshuka Rangi, Massimo Franceschetti JMLR: Workshop and Conference Proceedings . 2018,第1期

机译：带有反馈图和转换成本的在线学习
2. Constructing online switching barriers: examining the effects of switching costs and alternative attractiveness on e-store loyalty in online pure-play retailers [J] . Ghazali Ezlika, Bang Nguyen, Mutum Dilip S., Electronic Markets . 2016,第2期

机译：建立在线转换障碍：检查转换成本和替代吸引力对在线纯零售商的电子商店忠诚度的影响
3. Randomized distributed online algorithms against adaptive offline adversaries [J] . Boyar Joan, Ellen Faith, Larsen Kim S. Information Processing Letters . 2020,第Sepa期

机译：随机分布式在线算法反对适应性离线对手
4. Online Learning with Switching Costs and Other Adaptive Adversaries [C] . Nicolo Cesa-Bianchi, Ofer Dekel, Ohad Shamir Annual conference on Neural Information Processing Systems . 2013

机译：具有转换成本和其他适应性对手的在线学习
5. Competitive advantage in e-commerce firms: Profitability, customer retention and switching costs in online banking. [D] . Roust, Tamara. 2008

机译：电子商务公司的竞争优势：在线银行的盈利能力，客户保留率和转换成本。
6. The Costs of Online Learning: Examining Differences in Motivation and Academic Outcomes in Online and Face-to-Face Community College Developmental Mathematics Courses [O] . Michelle K. Francis, Stephanie V. Wormington, Chris Hulleman 1993

机译：在线学习的成本：检查在线和面对面社区大学发展数学课程的动机和学业成绩的差异
7. Online Learning with Graph-Structured Feedback against Adaptive Adversaries [O] . Zhili Feng, Po-Ling Loh 2018

机译：在线学习与图形结构反馈反对自适应对手

Online Learning with Switching Costs and Other Adaptive Adversaries

摘要

著录项

相似文献

相关主题

期刊订阅