首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Making Classifier Chains Resilient to Class Imbalance
【24h】

Making Classifier Chains Resilient to Class Imbalance

机译:使分类器链适应类不平衡

获取原文
           

摘要

Class imbalance is an intrinsic characteristic of multi-label data. Most of the labels in multi-label data sets are associated with a small number of training examples, much smaller compared to the size of the data set. Class imbalance poses a key challenge that plagues most multi-label learning methods. Ensemble of Classifier Chains (ECC), one of the most prominent multi-label learning methods, is no exception to this rule, as each of the binary models it builds is trained from all positive and negative examples of a label. To make ECC resilient to class imbalance, we first couple it with random undersampling. We then present two extensions of this basic approach, where we build a varying number of binary models per label and construct chains of different sizes, in order to improve the exploitation of majority examples with approximately the same computational budget. Experimental results on 16 multi-label datasets demonstrate the effectiveness of the proposed approaches in a variety of evaluation metrics.
机译:类不平衡是多标签数据的固有特征。多标签数据集中的大多数标签都与少量训练示例相关联,与数据集的大小相比要小得多。班级失衡是困扰大多数多标签学习方法的关键挑战。分类器链集成(ECC)是最著名的多标签学习方法之一,该规则也不例外,因为它构建的每个二元模型都是根据标签的所有正例和负例进行训练的。为了使ECC能够抵抗类不平衡,我们首先将其与随机欠采样耦合。然后,我们介绍了这种基本方法的两个扩展,其中我们为每个标签构建了不同数量的二进制模型,并构建了不同大小的链,以便以近似相同的计算预算来改进大多数示例的利用。在16个多标签数据集上的实验结果证明了所提出方法在各种评估指标中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号