首页> 外文期刊>SIGKDD explorations >Detecting Anomalies in Dynamic Rating Data: A Robust Probabilistic Model for Rating Evolution
【24h】

Detecting Anomalies in Dynamic Rating Data: A Robust Probabilistic Model for Rating Evolution

机译:检测动态评级数据中的异常:评级演化的鲁棒概率模型

获取原文
获取原文并翻译 | 示例
           

摘要

Rating data is ubiquitous on websites such as Amazon, Trip- Advisor, or Yelp. Since ratings are not static but given at various points in time, a temporal analysis of rating data provides deeper insights into the evolution of a product's quality. In this work, we tackle the following question: Given the time stamped rating data for a product or service, how can we detect the general rating behavior of users as well as time intervals where the ratings behave anomalous? We propose a Bayesian model that represents the rating data as sequence of categorical mixture models. In contrast to existing methods, our method does not require any aggregation of the input but it operates on the original time stamped data. To capture the dynamic effects of the ratings, the categorical mixtures are temporally constrained: Anomalies can occur in specific time intervals only and the general rating behavior should evolve smoothly over time. Our method automatically determines the intervals where anomalies occur, and it captures the temporal effects of the general behavior by using a state space model on the natural parameters of the categorical distributions. For learning our model, we propose an efficient algorithm combining principles from variational inference and dynamic programming. In our experimental study we show the effectiveness of our method and we present interesting discoveries on multiple real world datasets.
机译:评分数据在Amazon,Trip-Advisor或Yelp等网站上无处不在。由于评级不是静态的而是在各个时间点给出的,因此对评级数据的时间分析可提供对产品质量演变的更深入的了解。在这项工作中,我们解决以下问题:给定产品或服务的带时间戳的评分数据,我们如何检测用户的总体评分行为以及评分行为异常的时间间隔?我们提出了一种贝叶斯模型,该模型将评级数据表示为类别混合模型的序列。与现有方法相比,我们的方法不需要输入的任何汇总,但是可以对原始时间戳数据进行操作。为了捕获评级的动态影响,类别混合在时间上受限制:异常只能在特定的时间间隔内发生,并且总体评级行为应随时间平稳发展。我们的方法自动确定异常发生的时间间隔,并通过使用状态空间模型对分类分布的自然参数捕获一般行为的时间影响。为了学习我们的模型,我们提出了一种有效的算法,该算法结合了变分推理和动态规划的原理。在我们的实验研究中,我们展示了我们方法的有效性,并在多个现实世界数据集上提出了有趣的发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号