...
首页> 外文期刊>ACM Transactions on Interactive Intelligent Systems >A Bandit-Based Ensemble Framework for Exploration/ Exploitation of Diverse Recommendation Components: An Experimental Study within E-Commerce
【24h】

A Bandit-Based Ensemble Framework for Exploration/ Exploitation of Diverse Recommendation Components: An Experimental Study within E-Commerce

机译:基于Bandit的集合框架,用于探索/开发各种推荐组件:电子商务中的一项实验研究

获取原文
获取原文并翻译 | 示例
           

摘要

This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection of base recommendation algorithms for e-commerce. We focus on the problem of item-to-item recommendations, for which multiple behavioral and attribute-based predictors are provided to an ensemble learner. In addition, we detail the construction of a personalized predictor based on κ-Nearest Neighbors (κNN), with temporal decay capabilities and event weighting. We show how to adapt Thompson Sampling to realistic situations when neither action availability nor reward stationarity is guaranteed. Furthermore, we investigate the effects of priming the sampler with pre-set parameters of reward probability distributions by utilizing the product catalog and/or event history, when such information is available. We report our experimental results based on the analysis of three real-world e-commerce datasets.
机译:这项工作提出了汤普森抽样匪盗策略的扩展,用于协调电子商务的基本推荐算法的收集。我们关注项目对项目建议的问题,为此向集成学习者提供了多种基于行为和基于属性的预测变量。此外,我们详细介绍了基于κ最近邻居(κNN),具有时间衰减功能和事件加权的个性化预测变量的构造。我们将展示在无法保证动作可用性和奖励平稳性的情况下,如何使汤普森采样适应现实情况。此外,当此类信息可用时,我们通过利用产品目录和/或事件历史记录来研究使用奖励概率分布的预设参数启动采样器的效果。我们基于对三个现实世界电子商务数据集的分析报告了我们的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号