首页> 外文会议>International Conference on Advances in Natural Language Processing(EsTAL 2004); 20041020-22; Alicante(ES) >Adaptive Selection of Base Classifiers in One-Against-All Learning for Large Multi-labeled Collections
【24h】

Adaptive Selection of Base Classifiers in One-Against-All Learning for Large Multi-labeled Collections

机译:大型多标签集合的“一站式”学习中的基础分类器自适应选择

获取原文
获取原文并翻译 | 示例

摘要

In this paper we present the problem found when studying an automated text categorization system for a collection of High Energy Physics (HEP) papers, which shows a very large number of possible classes (over 1,000) with highly imbalanced distribution. The collection is introduced to the scientific community and its imbalance is studied applying a new indicator: the inner imbalance degree. The one-against-all approach is used to perform multi-label assignment using Support Vector Machines. Over-weighting of positive samples and S-Cut thresholding is compared to an approach to automatically select a classifier for each class from a set of candidates. We also found that it is possible to reduce computational cost of the classification task by discarding classes for which classifiers cannot be trained successfully.
机译:在本文中,我们提出了在研究针对高能物理(HEP)论文的自动文本分类系统时发现的问题,该系统显示出大量可能的类(超过1,000个)具有高度不平衡的分布。该馆藏介绍给科学界,并使用新的指标:内部不平衡度来研究其不平衡。反对一切的方法用于使用支持向量机执行多标签分配。将正样本的超重和S-Cut阈值化与自动从一组候选中为每个类别选择分类器的方法进行比较。我们还发现,有可能通过丢弃分类器无法成功训练的类来降低分类任务的计算成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号