首页> 外文会议>IASTED International Conference on Artificial Intelligence and Applications >INCREMENTAL METHODS FOR DETECTING OUTLIERS FROM MULTIVARIATE DATA STREAM
【24h】

INCREMENTAL METHODS FOR DETECTING OUTLIERS FROM MULTIVARIATE DATA STREAM

机译:从多变量数据流检测异常值的增量方法

获取原文

摘要

Outlier detection is one of the most important data mining techniques. It has broad applications like fraud detection, credit approval, computer network intrusion detection, anti-money laundering, etc. The basis of outlier detection is to identify data points which are "different" or "far away" from the rest of the data points in the given dataset. Traditional outlier detection method is based on statistical analysis. However, this traditional method has an inherent drawback - it requires the availability of the entire dataset. In practice, especially in the real time data feed application, it is not so realistic to wait for all the data because fresh data are streaming in very quickly. Outlier detection is hence done in batches. However two drawbacks may arise: relatively long processing time because of the massive size, and the result may be outdated soon between successive updates. In this paper, we propose several novel incremental methods to process the real time data effectively for outlier detection. For the experiment, we test three types of mechanisms for analyzing the dataset, namely Global Analysis, Cumulative Analysis and Lightweight Analysis with Sliding Window. The experiment dataset is "household power consumption" which is a popular benchmarking data for Massive Online Analysis.
机译:异常值检测是最重要的数据挖掘技术之一。它具有广泛的应用程序,如欺诈检测,信用批准,计算机网络入侵检测,反洗钱等。远离探测的基础是从数据点的其余部分识别“不同”或“远离”的数据点在给定的数据集中。传统的异常值检测方法基于统计分析。但是,这种传统方法具有固有的缺点 - 它需要整个数据集的可用性。在实践中,特别是在实时数据馈送应用中,等待所有数据并不是如此现实,因为新数据在很快流动。因此,批量生产的异常检测然而,可能出现两个缺点:由于大小的大小而相对长的处理时间,并且结果可以在连续更新之间很快过时。在本文中,我们提出了几种新颖的增量方法来有效地处理实时数据,以便进行异常检测。对于实验,我们测试三种类型的机制,用于分析数据集,即全局分析,累积分析和轻量级分析,通过滑动窗口。实验数据集是“家庭电力消耗”,这是一种流行的基准测试数据,用于大规模在线分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号