首页> 外文会议>IEEE/CIC International Conference on Communications in China >Online MEM Based Binary Classification Algorithm for China Mobile Imbalanced Dataset
【24h】

Online MEM Based Binary Classification Algorithm for China Mobile Imbalanced Dataset

机译:基于在线MEM的中国移动不平衡数据集二进制分类算法

获取原文

摘要

Driven by a plethora of real machine learning applications, there have been many attempts at improving the performance of a classifier applied to imbalanced dataset. In this paper we propose a maximum entropy machine (MEM) based hybrid algorithm to handle binary classification problems with high imbalance ratios and large numbers of features in the datasets. At the training stage, we combine an efficient MEM algorithm with the SMOTE algorithm to build a classifier in a batch manner. At the application stage, the different-cost strategy is incorporated into the MEM algorithm to handle the imbalance learning problem in an online manner. Experiments are conducted based on various real datasets (including one China Mobile dataset and several other standard test datasets) with different imbalance ratios and different numbers of features. The results show that the proposed algorithm outperforms the state-of-the-art algorithms significantly in terms of robustness and overall classification performance.
机译:在大量真实的机器学习应用程序的推动下,人们进行了许多尝试来提高应用于不平衡数据集的分类器的性能。在本文中,我们提出了一种基于最大熵机器(MEM)的混合算法来处理具有高不平衡率和数据集中大量特征的二进制分类问题。在训练阶段,我们将高效的MEM算法与SMOTE算法相结合,以分批方式构建分类器。在应用阶段,将差异成本策略合并到MEM算法中,以在线方式处理不平衡学习问题。实验是基于不平衡率和特征数量不同的各种真实数据集(包括一个中国移动数据集和其他几个标准测试数据集)进行的。结果表明,在鲁棒性和整体分类性能方面,该算法明显优于最新算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号