首页>
外国专利>
METHOD FOR SEMI SUPERVISED REINFORCEMENT LEARNING USING DATA WITH LABEL AND DATA WITHOUT LABEL TOGETHER AND APPARATUS USING THE SAME
METHOD FOR SEMI SUPERVISED REINFORCEMENT LEARNING USING DATA WITH LABEL AND DATA WITHOUT LABEL TOGETHER AND APPARATUS USING THE SAME
展开▼
机译:使用带标签的数据和不带标签的数据进行半监督加固学习的方法和使用该方法的设备
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a method for conducting semisupervised reinforcement learning jointly using labeled data and unlabeled data, and a device using the same. To be more specifically, the method allows a computing device to train a baseline neural network as an immediate compensation indicator by using labeled data when acquiring the labeled data and unlabeled data, and to train a policy neural network for searching a subset of the unlabeled data, wherein the subset is searched so that the validation accuracy of the immediate compensation indicator can increase during additional training of the immediate compensation indicator. Then, the computing device additionally trains the immediate compensation indicator by using the label given to the subset through the subset and the policy neural network. Therefore, the semisupervised reinforcement learning method can improve accuracy and efficiency of diagnosis assistance by machine learning.
展开▼