首页> 外国专利> METHOD FOR SEMI SUPERVISED REINFORCEMENT LEARNING USING DATA WITH LABEL AND DATA WITHOUT LABEL TOGETHER AND APPARATUS USING THE SAME

METHOD FOR SEMI SUPERVISED REINFORCEMENT LEARNING USING DATA WITH LABEL AND DATA WITHOUT LABEL TOGETHER AND APPARATUS USING THE SAME

机译：使用带标签的数据和不带标签的数据进行半监督加固学习的方法和使用该方法的设备

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a method for conducting semisupervised reinforcement learning jointly using labeled data and unlabeled data, and a device using the same. To be more specifically, the method allows a computing device to train a baseline neural network as an immediate compensation indicator by using labeled data when acquiring the labeled data and unlabeled data, and to train a policy neural network for searching a subset of the unlabeled data, wherein the subset is searched so that the validation accuracy of the immediate compensation indicator can increase during additional training of the immediate compensation indicator. Then, the computing device additionally trains the immediate compensation indicator by using the label given to the subset through the subset and the policy neural network. Therefore, the semisupervised reinforcement learning method can improve accuracy and efficiency of diagnosis assistance by machine learning.

机译：本发明涉及一种使用标记数据和未标记数据共同进行半监督强化学习的方法，以及使用该方法的设备。更具体地，该方法允许计算设备在获取标记的数据和未标记的数据时通过使用标记的数据来训练基线神经网络作为立即补偿指示符，并训练策略神经网络以搜索未标记的数据的子集。，其中搜索该子集，以便在对即时补偿指标进行额外培训期间可以提高即时补偿指标的验证准确性。然后，计算设备还通过使用通过子集和策略神经网络给予子集的标签来训练立即补偿指示符。因此，半监督强化学习方法可以通过机器学习提高诊断辅助的准确性和效率。

著录项

公开/公告号KR20190117969A

专利类型
公开/公告日2019-10-17

原文格式PDF
申请/专利权人 VUNO INC.;
展开▼

申请/专利号KR20180040972
发明设计人 PARK SEJIN;
展开▼

申请日2018-04-09
分类号G16H50/20;G06N3/08;G16H50/50;
国家 KR
入库时间 2022-08-21 11:49:39

相似文献

专利
外文文献
中文文献