首页> 外文期刊>Knowledge and Information Systems >Detecting outlying subspaces for high-dimensional data: the new task, algorithms, and performance
【24h】

Detecting outlying subspaces for high-dimensional data: the new task, algorithms, and performance

机译:检测高维数据的外围子空间:新任务,算法和性能

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we identify a new task for studying the outlying degree (OD) of high-dimensional data, i.e. finding the subspaces (subsets of features) in which the given points are outliers, which are called their outlying subspaces. Since the state-of-the-art outlier detection techniques fail to handle this new problem, we propose a novel detection algorithm, called High-Dimension Outlying subspace Detection (HighDOD), to detect the outlying subspaces of high-dimensional data efficiently. The intuitive idea of HighDOD is that we measure the OD of the point using the sum of distances between this point and itsknearest neighbors. Two heuristic pruning strategies are proposed to realize fast pruning in the subspace search and an efficient dynamic subspace search method with a sample-based learning process has been implemented. Experimental results show that HighDOD is efficient and outperforms other searching alternatives such as the naive top–down, bottom–up and random search methods, and the existing outlier detection methods cannot fulfill this new task effectively.
机译:在本文中,我们确定了一项研究高维数据的离边度(OD)的新任务,即查找给定点为离群点的子空间(要素的子集),称为离群点子空间。由于最新的离群值检测技术无法解决这个新问题,因此我们提出了一种新颖的检测算法,称为高维离析子空间检测(HighDOD),可以有效地检测高维数据的离析子空间。 HighDOD的直观想法是,我们使用该点与其近邻之间的距离之和来测量该点的OD。提出了两种启发式修剪策略以实现子空间搜索中的快速修剪,并实现了一种基于样本学习过程的高效动态子空间搜索方法。实验结果表明,HighDOD是有效的并且优于其他搜索选择,例如朴素的自上而下,自下而上和随机搜索方法,并且现有的异常值检测方法无法有效地完成这项新任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号