首页> 外文会议>Conference on Uncertainty in Artificial Intelligence >Reducing Exploration of Dying Arms in Mortal Bandits
【24h】

Reducing Exploration of Dying Arms in Mortal Bandits

机译:减少凡人垂死臂的探索

获取原文

摘要

Mortal bandits have proven to be extremely useful for providing news article recommendations, running automated online advertising campaigns, and for other applications where the set of available options changes over time. Previous work on this problem showed how to regulate exploration of new arms when they have recently appeared, but they do not adapt when the arms are about to disappear. Since in most applications we can determine either exactly or approximately when arms will disappear, we can leverage this information to improve performance: we should not be exploring arms that are about to disappear. We provide adaptations of algorithms, regret bounds, and experiments for this study, showing a clear benefit from regulating greed (exploration/exploitation) for arms that will soon disappear. We illustrate numerical performance on the Yahoo! Front Page Today Module User Click Log Dataset.
机译:致命匪徒已被证明是为提供新闻文章的建议,运行自动在线广告活动,以及其他可用选项随时间变化的其他应用程序非常有用。以前的工作就解决了这个问题,如何在最近出现时调节对新手臂的探索,但是当手臂即将消失时,它们不适应。由于在大多数应用程序中,我们可以确定或大约在手臂将消失时,我们可以利用这些信息来提高性能:我们不应该探索即将消失的武器。我们提供该研究的算法,遗憾界和实验的调整,从而清除了调节贪婪(勘探/剥削)的武器,这将很快消失。我们说明了雅虎的数值表现!首页今天模块用户单击日志数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号