首页> 外文会议>IEEE International Conference on Engineering Technologies and Applied Sciences >Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets
【24h】

Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets

机译:改进的自适应合成SMOTE,以提高不平衡数据集中的分类性能

获取原文

摘要

The oversampling technique in the data preprocessing has been utilized to mitigate the imbalanced data problem in the real research scenario. This imbalance may reduce the ability of classification algorithms to recognize cases of interest leading to misclassification of positive samples as negative class or the false positive generation. Synthetic Minority Oversampling Technique (SMOTE) is one of the oversampling techniques existing and the Adaptive Synthetic (Adasyn) SMOTE is one of its many variants. K-Nearest Neighbor (KNN) is incorporated in Adasyn. In this study, Manhattan distance is applied in the KNN computations. This modified Adasyn was evaluated in terms of its effectiveness in the performance measure of overall accuracy, precision, recall and F1 measure on the six imbalanced datasets using logistic regression as the classification algorithm. The modified Adasyn dominated over SMOTE and the original Adasyn by 66.67 percent of the total performance metric count. It leads the accuracy and recall count with 4 out of 6, precision count with 3 out of 6, and the F1 measure count with 5 over 6. Thus, proving that the modified Adasyn can provide an efficient solution in decreasing misclassification on imbalanced datasets.
机译:数据预处理中的过采样技术已被用于缓解实际研究场景中的数据不平衡问题。这种不平衡可能会降低分类算法识别感兴趣案例的能力,从而导致将正样本错误分类为负分类或错误的正生成。合成少数族裔过采样技术(SMOTE)是现有的过采样技术之一,而自适应合成(Adasyn)SMOTE是其众多变体之一。 K-最近邻居(KNN)合并在Adasyn中。在这项研究中,曼哈顿距离应用于KNN计算中。使用Logistic回归作为分类算法,在六个不平衡数据集的整体准确性,精确度,召回率和F1量度的性能量度的有效性方面,评估了这种改进的Adasyn。相比于SMOTE和原始Adasyn,修改后的Adasyn占总性能指标计数的66.67%。它以6分之4的精度和召回率领先,6分之3的精度计数和6分中的5分的F1度量值领先。因此,证明了改进的Adasyn可以为减少不平衡数据集的错误分类提供有效的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号