首页> 外文会议>Conference on Towards Autonomous Robotic Systems >A Vision-Guided Parallel Parking System for a Mobile Robot using Approximate Policy Iteration
【24h】

A Vision-Guided Parallel Parking System for a Mobile Robot using Approximate Policy Iteration

机译:使用近似政策迭代的移动机器人的视觉引导并行停车系统

获取原文

摘要

Reinforcement Learning (RL) methods enable autonomous robots to learn skills from scratch by interacting with the environment. However, reinforcement learning can be very time consuming. This paper focuses on accelerating the reinforcement learning process on a mobile robot in an unknown environment. The presented algorithm is based on approximate policy iteration with a continuous state space and a fixed number of actions. The action-value function is represented by a weighted combination of basis functions. Furthermore, a complexity analysis is provided to show that the implemented approach is guaranteed to converge on an optimal policy with less computational time. A parallel parking task is selected for testing purposes. In the experiments, the efficiency of the proposed approach is demonstrated and analyzed through a set of simulated and real robot experiments, with comparison drawn from two well known algorithms (Dyna-Q and Q-learning).
机译:强化学习(RL)方法使自主机器人通过与环境进行交互来学习从头开始学习技能。然而,加强学习可能非常耗时。本文侧重于在未知环境中加速在移动机器人上的加固学习过程。呈现的算法基于具有连续状态空间的近似政策迭代和固定数量的动作。动作值函数由基础函数的加权组合表示。此外,提供了复杂性分析以表明实现的方法是保证在具有较少计算时间的最佳策略上收敛。选择并行停车任务以进行测试目的。在实验中,通过一组模拟和真正的机器人实验来证明和分析所提出的方法的效率,从两个公知的算法(Dyna-Q和Q-Learning)中汲取的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号