首页> 外文会议>International Conference on Machine Learning >Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data
【24h】

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

机译:基于正和未标记数据分类的半监督分类

获取原文

摘要

Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unlabeled data. In this paper, we extend PU classification to also incorporate negative data and propose a novel semi-supervised classification approach. We establish generalization error bounds for our novel methods and show that the bounds decrease with respect to the number of unlabeled data without the distributional assumptions that are required in existing semi-supervised classification methods. Through experiments, we demonstrate the usefulness of the proposed methods.
机译:大多数半监督的分类方法开发了到目前为止,使用未标记的数据进行正则化的特定分布假设,例如群集假设。相比之下,最近开发了来自正和未标记数据的分类方法(PU分类)使用未标记的风险评估数据,即标签信息直接从未标记的数据中提取。在本文中,我们扩展了PU分类,还纳入负数据并提出了一种新的半监督分类方法。我们为我们的新方法建立了泛化误差界限,并表明,在没有现有半监督分类方法中所需的分布假设的情况下,界限相对于未标记数据的数量减少。通过实验,我们证明了所提出的方法的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号