Corrupted Contextual Bandits: Online Learning with Corrupted Context

机译：损坏的上下文匪徒：与上下文损坏的在线学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a novel variant of the contextual bandit problem (i.e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the context used at each decision may be corrupted ("useless context"). This new problem is motivated by certain on-line settings including clinical trial and ad recommendation applications. In order to address the corrupted-context setting, we propose to combine the standard contextual bandit approach with a classical multi-armed bandit mechanism. Unlike standard contextual bandit methods, we are able to learn from all iteration, even those with corrupted context, by improving the computing of the expectation for each arm. Promising empirical results are obtained on several real-life datasets.

机译：我们考虑一个新颖的语境强盗问题的新变种（即，具有侧面信息的多武装强盗，或者上下文，可用于决策者），其中每个决定中使用的上下文可能被破坏（“无用的上下文”）。这一新问题是由某些在线设置的动机，包括临床试验和广告推荐应用。为了解决损坏的上下文设置，我们建议将标准的上下文强盗方法与经典的多武装强盗机构相结合。与标准的上下文强盗方法不同，我们能够通过改善对每个臂的期望的计算来学习甚至具有损坏的上下文的迭代。有希望的经验结果是在几个现实生活数据集上获得的。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|3145-3149|共5页
会议地点
作者
Djallel Bouneffouf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Upper bound; Conferences; Signal processing algorithms; Signal processing; Clinical trials; Acoustics; Medical diagnosis;

机译：上限;会议;信号处理算法;信号处理;临床试验;声学;医学诊断;

相似文献

外文文献
中文文献
专利

1. Corrupt Bandits for Preserving Local Privacy [J] . Pratik Gajane, Tanguy Urvoy, Emilie Kaufmann JMLR: Workshop and Conference Proceedings . 2018,第4期

机译：维护当地隐私的腐败土匪
2. Corrupt Bandits for Preserving Local Privacy [J] . Pratik Gajane, Tanguy Urvoy, Emilie Kaufmann JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：维护当地隐私的腐败土匪
3. Distributed Online Learning via Cooperative Contextual Bandits [J] . Tekin Cem, van der Schaar Mihaela Signal Processing, IEEE Transactions on . 2015,第14期

机译：通过协作上下文强盗进行分布式在线学习
4. How to Keep an Online Learning Chatbot From Being Corrupted [C] . Yixuan Chai, Guohua Liu, Ziwei Jin, International Joint Conference on Neural Networks . 2020

机译：如何防止在线学习聊天机器人遭到破坏
5. Sparse Methods for Learning Multiple Subspaces from Large-scale, Corrupted and Imbalanced Data [D] . You, Chong. 2018

机译：从大规模，损坏和不平衡数据中学习多个子空间的稀疏方法
6. How is spatial context learning integrated over signal versus noise? A primacy effect in contextual cueing [O] . Justin A. Jungé, Brian J. Scholl, Marvin M. Chun -1

机译：如何在信号与噪声之间整合空间上下文学习？上下文提示中的首要效应
7. Context Attentive Bandits: Contextual Bandit with Restricted Context [O] . Bouneffouf, Djallel, Rish, Irina, Cecchi, Guillermo A., 2017

机译：语境殷勤强盗：具有受限上下文的语境强盗

Corrupted Contextual Bandits: Online Learning with Corrupted Context

摘要

著录项

相似文献

相关主题

期刊订阅