首页> 外文会议>AI 2010: Advances in artificial intelligence >An Effective Pattern Based Outlier Detection Approach for Mixed Attribute Data
【24h】

An Effective Pattern Based Outlier Detection Approach for Mixed Attribute Data

机译:一种基于有效模式的混合属性数据离群值检测方法

获取原文
获取原文并翻译 | 示例

摘要

Detecting outliers in mixed attribute datasets is one of major challenges in real world applications. Existing outlier detection methods lack effectiveness for mixed attribute datasets mainly due to their inability of considering interactions among different types of, e.g., numerical and categorical attributes. To address this issue in mixed attribute datasets, we propose a novel Pattern based Outlier Detection approach (POD). Pattern in this paper is defined to describe majority of data as well as capture interactions among different types of attributes. In POD, the more does an object deviate from these patterns, the higher is its outlier factor. We use logistic regression to learn patterns and then formulate the outlier factor in mixed attribute datasets. A series of experimental results illustrate that POD performs statistically significantly better than several classic outlier detection methods.
机译:在混合属性数据集中检测异常值是现实应用中的主要挑战之一。现有的离群值检测方法对于混合属性数据集缺乏有效性,这主要是因为它们无法考虑不同类型(例如,数值和分类属性)之间的相互作用。为了解决混合属性数据集中的这一问题,我们提出了一种新颖的基于模式的离群值检测方法(POD)。本文中定义的模式用于描述大多数数据以及捕获不同类型的属性之间的交互。在POD中,对象偏离这些模式的次数越多,其离群因素就越高。我们使用逻辑回归学习模式,然后在混合属性数据集中制定离群因子。一系列实验结果表明,POD的统计性能明显优于几种经典的异常值检测方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号