【24h】

Active Bagging Ensemble Selection

机译:主动套袋组合选择

获取原文

摘要

As technology progresses with more and more data collected, the need of finding the appropriate label for them increases. However, many times the labeling process is a very difficult or/and expensive task and in most cases a help of an expert or expensive equipment is needed. For this reason the need of labeling only the most appropriate instances rises. Active Learning techniques can accomplish this by querying only those instances that a trained model finds the greatest amount of information and providing them to a human expert in order to label them. Combining these techniques with a fast ensemble classifier, a very performant in terms of classification accuracy schema can emerge where a trained model in a small amount of labeled instances can grow by adding only the most informative instances from a much greater pool of unlabeled instances. In this paper, we will propose such a schema using Bagging Ensemble Selection that uses REPTree as base classifier under Active Learning techniques and we will compare it to four well-known ensemble classifiers under the same techniques on 61 real world datasets.
机译:随着技术的进步,收集的数据越来越多,为它们找到合适标签的需求也在增加。然而,很多时候,标签过程是一项非常困难或/和昂贵的任务,在大多数情况下,需要专家或昂贵设备的帮助。由于这个原因,只需要标记最合适的实例的需求增加了。主动学习技术可以通过只查询那些经过训练的模型发现的信息量最大的实例,并将它们提供给人类专家来标记它们来实现这一点。将这些技术与快速集成分类器相结合,就分类精度而言,可以出现一个非常高效的模式,其中少量标记实例中的训练模型可以通过从大量未标记实例中只添加信息量最大的实例来增长。在本文中,我们将提出一种使用Bagging集成选择的模式,该模式在主动学习技术下使用REPTree作为基础分类器,并将其与61个真实数据集上相同技术下的四个著名集成分类器进行比较。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号