首页> 美国卫生研究院文献>Cognitive Neurodynamics >Route searching based on neural networks and heuristic reinforcement learning
【2h】

Route searching based on neural networks and heuristic reinforcement learning

机译:基于神经网络和启发式强化学习的路线搜索

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper, an improved and much stronger RNH-QL method based on RBF network and heuristic Q-learning was put forward for route searching in a larger state space. Firstly, it solves the problem of inefficiency of reinforcement learning if a given problem’s state space is increased and there is a lack of prior information on the environment. Secondly, RBF network as weight updating rule, reward shaping can give an additional feedback to the agent in some intermediate states, which will help to guide the agent towards the goal state in a more controlled fashion. Meanwhile, with the process of Q-learning, it is accessible to the underlying dynamic knowledge, instead of the need of background knowledge of an upper level RBF network. Thirdly, it improves the learning efficiency by incorporating the greedy exploitation strategy to train the neural network, which has been testified by the experimental results.
机译:提出了一种基于RBF网络和启发式Q学习的改进,更强大的RNH-QL方法,用于在较大状态空间中进行路径搜索。首先,如果增加了给定问题的状态空间并且缺少有关环境的先验信息,则可以解决强化学习效率低下的问题。其次,作为权重更新规则的RBF网络,奖励整形可以在某些中间状态下向代理提供额外的反馈,这将有助于以更可控的方式将代理引导至目标状态。同时,通过Q学习过程,底层动态知识可以访问它,而不需要上层RBF网络的背景知识。第三,结合贪婪开发策略训练神经网络,提高了学习效率,实验结果证明了这一点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号