...
首页> 外文期刊>Journal of applied statistics >Outlier detection with Mahalanobis square distance: incorporating small sample correction factor
【24h】

Outlier detection with Mahalanobis square distance: incorporating small sample correction factor

机译:Mahalanobis平方距离的离群值检测:合并小的样本校正因子

获取原文
获取原文并翻译 | 示例
           

摘要

Mahalanobis square distances (MSDs) based on robust estimators improves outlier detection performance in multivariate data. However, the unbiasedness of robust estimators are not guaranteed when the sample size is small and this reduces their performance in outlier detection. In this study, we propose a framework that uses MSDs with incorporated small sample correction factor (c) and show its impact on performance when the sample size is small. This is achieved by using two prototypes, minimum covariance determinant estimator and S-estimators with bi-weight and t-biweight functions. The results from simulations show that distribution of MSDs for non-extreme observations are more likely to fit to chi-square with p degrees of freedom and MSDs of the extreme observations fit to F distribution, when c is incorporated into the model. However, without c, the distributions deviate significantly from chi-square and F observed for the case with incorporated c. These results are even more prominent for S-estimators. We present seven distinct comparison methods with robust estimators and various cut-off values and test their outlier detection performance with simulated data. We also present an application of some of these methods to the real data.
机译:基于鲁棒估计器的马氏距离(MSD)可改善多变量数据中的异常值检测性能。但是,当样本量较小时,不能保证鲁棒估计量的无偏性,这会降低其在异常值检测中的性能。在这项研究中,我们提出了一个框架,该框架使用结合了小的样本校正因子(c)的MSD,并显示了样本量较小时其对性能的影响。这是通过使用两个原型实现的,它们是具有双权函数和t-双权函数的最小协方差行列式估计器和S估计器。模拟结果表明,将c纳入模型后,用于非极端观测的MSD分布更适合于具有p自由度的卡方,而极端观测的MSD则适合于F分布。但是,没有c时,对于合并c的情况,分布与卡方和F明显不同。这些结果对于S估计量甚至更为突出。我们提出了七个具有鲁棒估计量和各种截断值的不同比较方法,并使用模拟数据测试了它们的异常检测性能。我们还介绍了其中一些方法对实际数据的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号