首页> 外文会议>Neural Networks (IJCNN), The 2012 International Joint Conference on >Applying adaptive over-sampling technique based on data density and cost-sensitive SVM to imbalanced learning
【24h】

Applying adaptive over-sampling technique based on data density and cost-sensitive SVM to imbalanced learning

机译:基于数据密度和成本敏感型支持向量机的自适应过采样技术在不平衡学习中的应用

获取原文

摘要

Resampling method is a popular and effective technique to imbalanced learning. However, most resampling methods ignore data density information and may lead to overfitting. A novel adaptive over-sampling technique based on data density (ASMOBD) is proposed in this paper. Compared with existing resampling algorithms, ASMOBD can adaptively synthesize different number of new samples around each minority sample according to its level of learning difficulty. Therefore, this method makes the decision region more specific and can eliminate noise. What's more, to avoid over generalization, two smoothing methods are proposed. Cost- Sensitive learning is also an effective technique to imbalanced learning. In this paper, ASMOBD and Cost-Sensitive SVM are combined. Experiments show that our methods perform better than various state-of-art approaches on 9 UCI datasets by using metrics of G-mean and area under the receiver operation curve (AUC).
机译:重采样方法是一种不平衡学习的流行且有效的技术。但是,大多数重采样方法会忽略数据密度信息,并可能导致过度拟合。提出了一种新的基于数据密度的自适应过采样技术(ASMOBD)。与现有的重采样算法相比,ASMOBD可以根据其学习难度水平,在每个少数样本周围自适应地合成不同数量的新样本。因此,该方法使得决策区域更具体并且可以消除噪声。此外,为避免过度概括,提出了两种平滑方法。成本敏感型学习也是一种不平衡学习的有效技术。本文将ASMOBD和成本敏感型SVM结合起来。实验表明,通过使用G均值和接收器操作曲线(AUC)下的面积的度量,我们的方法在9个UCI数据集上的性能优于各种最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号