...
首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Skill based transfer learning with domain adaptation for continuous reinforcement learning domains
【24h】

Skill based transfer learning with domain adaptation for continuous reinforcement learning domains

机译:基于技能的转移学习与持续加强学习域的域改编

获取原文
获取原文并翻译 | 示例
           

摘要

Although reinforcement learning is known as an effective machine learning technique, it might perform poorly in complex problems, especially real-world problems, leading to a slow rate of convergence. This issue magnifies when facing continuous domains where the curse of dimensionality is inevitable, and generalization is mostly desired. Transfer learning is a successful technique to remedy such a problem which results in significant improvements in learning performance by providing generalization not only within a task but also across different but related or similar tasks. The critical issue in transfer learning is how to incorporate the knowledge acquired from learning in a different but related task in the past. Domain adaptation is an exciting paradigm that seeks to address this challenge. In this paper, we propose a novel skill based Transfer Learning with Domain Adaptation (TLDA) approach suitable for continuous RL problems. TLDA discovers and learns skills as high-level knowledge from source task and then uses domain adaptation technique to help agent discover state-action mapping as a relation between the source and target tasks. With such mapping, TLDA can adapt source skills and speed up learning on a new target task. The experimental results verify the achievement of an effective transfer learning method for continuous reinforcement learning problems.
机译:虽然强化学习被称为有效的机器学习技术,但它可能在复杂问题中表现不佳,尤其是真实的问题,导致收敛速度慢。当面对一系列诅咒是不可避免的,这个问题在面对的连续域时放大,并且主要需要泛化。转移学习是一种成功的技术来解决这种问题,这导致不仅在任务中的概括而且在不同但相关或类似的任务中提供泛化的显着改善。转让学习的关键问题是如何在过去纳入不同但相关任务中获取的知识。域适应是一种令人兴奋的范例,寻求解决这一挑战。在本文中,我们提出了一种新的基于技能的转移学习,具有适合于连续RL问题的域适应(TLDA)方法。 TLDA发现并将技能从源任务中发现和学习技能,然后使用域自适应技术来帮助代理发现状态操作映射作为源和目标任务之间的关系。通过这种映射,TLDA可以根据新的目标任务调整源技能和加速学习。实验结果验证了实现持续增强学习问题的有效转移学习方法的实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号