一种基于最大值损失函数的快速偏标记学习算法

周瑜; 贺建军; 顾宏; 张俊星

首页> 中文期刊> 《计算机研究与发展》 >一种基于最大值损失函数的快速偏标记学习算法

一种基于最大值损失函数的快速偏标记学习算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the age of big data ,learning with weak supervision has become one of the hot research topics in machine learning field . Partial label learning , which deals with the problem where each training example is associated with a set of candidate labels among which only one label corresponds to the ground-truth ,is an important weakly-supervised machine learning frameworks proposed recently and can be widely used in many real world tasks .The max-loss function may be used to accurately capture the relationship between the partial labeled sample and its labels .However ,since the max-loss function usually brings us a nondifferentiable objective function difficult to be solved ,it is rarely adopted in the existing algorithms .Moreover ,the existing partial label learning algorithms can only deal with the problem with small-scale data ,and rarely can be used to deal with big data .To cure above two problems , this paper presents a fast partial label learning algorithm with the max-loss function .The basic idea is to transform the nondifferentiable objective to a differentiable concave function by introducing the aggregate function to approximate the max (・) function involved in the max-lass function ,and then to solve the obtained concave objective function by using a stochastic quasi-New ton method . The experimental results show that the proposed algorithm can not only achieve higher accuracy but also use shorter computing time than the state-of-the-art algorithms with average-loss functions .Moreover ,the proposed algorithm can deal with the problems with millions samples within several minutes .%在弱监督信息条件下进行学习已成为大数据时代机器学习领域的研究热点，偏标记学习是最近提出的一种重要的弱监督学习框架，主要解决在只知道训练样本的真实标记属于某个候选标记集合的情况下如何进行学习的问题，在很多领域都具有广泛应用。最大值损失函数可以很好地描述偏标记学习中的样本与候选标记间的关系，但是由于建立的模型通常是一个难以求解的非光滑函数，目前还没有建立基于该损失函数的偏标记学习算法。此外，已有的偏标记学习算法都只能处理样本规模比较小的问题，还没看到面向大数据的算法。针对以上2个问题，先利用凝聚函数逼近最大值损失函数中的 max（・）将模型的目标函数转换为一个光滑的凹函数，然后利用随机拟牛顿法对其进行求解，最终实现了一种基于最大值损失函数的快速偏标记学习算法。仿真实验结果表明，此算法不仅要比基于均值损失函数的传统算法取得更好的分类精度，运行速度上也远远快于这些算法，处理样本规模达到百万级的问题只需要几分钟。

著录项

来源
《计算机研究与发展》 |2016年第5期|1053-1062|共10页
作者
周瑜; 贺建军; 顾宏; 张俊星;
展开▼
作者单位

大连理工大学电子信息与电气工程学部辽宁大连 116024;

大连理工大学电子信息与电气工程学部辽宁大连 116024;

大连民族大学信息与通信工程学院辽宁大连 116600;

大连理工大学电子信息与电气工程学部辽宁大连 116024;

大连民族大学信息与通信工程学院辽宁大连 116600;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
偏标记学习; 最大值损失函数; 凝聚函数; 弱监督学习; 分类精度;

相似文献

中文文献
外文文献
专利

1. 基于三元纠错输出编码的偏标记学习算法基于三元纠错输出编码的偏标记学习算法 [J] . 周斌斌 ,张敏灵 ,刘胥影 . 计算机科学与探索 . 2018,第009期
2. 基于变分高斯过程模型的快速核偏标记学习算法 [J] . 周瑜 ,贺建军 ,顾宏 . 计算机研究与发展 . 2017,第001期
3. 一种基于最大间隔的偏标记学习算法 [J] . 张仕将 ,柴晶 . 科学技术与工程 . 2018,第028期
4. 一种KD树集成偏标记学习算法 [J] . 卢勇全 ,刘振丙 ,颜振翔 . 桂林电子科技大学学报 . 2019,第006期
5. 基于MPI的近邻距离加权偏标记学习算法之并行实现 [J] . 王进 ,高延雨 ,邓欣 . 江苏大学学报（自然科学版） . 2018,第006期
6. 一种二次损失函数支持向量机的学习算法 [C] . 张浩然 ,汪晓东 ,张长江 . 2005年中国模糊逻辑与计算智能联合学术会议 . 2005
7. 基于度量学习和最大值损失函数的偏标记学习算法研究 [A] . 周瑜 . 2017

一种基于最大值损失函数的快速偏标记学习算法

摘要

著录项

相似文献

相关主题

期刊订阅