首页> 外文会议>International Conference on Robotics and Automation >Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) For Learning Multi-Goal, Continuous Action and State Space Controllers

【24h】

Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) For Learning Multi-Goal, Continuous Action and State Space Controllers

机译：用于学习多目标，连续动作和状态空间控制器的连续值迭代（CVI）强化学习和虚幻体验重放（IER）

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel model-free Reinforcement Learning algorithm for learning behavior in continuous action, state, and goal spaces. The algorithm approximates optimal value functions using non-parametric estimators. It is able to efficiently learn to reach multiple arbitrary goals in deterministic and nondeterministic environments. To improve generalization in the goal space, we propose a novel sample augmentation technique. Using these methods, robots learn faster and overall better controllers. We benchmark the proposed algorithms using simulation and a real-world voltage controlled robot that learns to maneuver in a non-observable Cartesian task space.

机译：本文提出了一种新颖的无模型强化学习算法，用于学习连续动作，状态和目标空间中的行为。该算法使用非参数估计量来近似最佳值函数。它能够有效地学习在确定性和非确定性环境中达到多个任意目标。为了提高目标空间的概括性，我们提出了一种新颖的样本增强技术。使用这些方法，机器人可以学习更快，整体上更好的控制器。我们使用仿真和在不可见的笛卡尔任务空间中进行机动的真实世界电压控制机器人对提出的算法进行基准测试。

著录项

来源
《International Conference on Robotics and Automation》|2019年|7173-7179|共7页
会议地点 Montreal(CA)
作者
Andreas Gerken; Michael Spranger;
展开▼
作者单位

Sony Computer Science Laboratories Inc. Tokyo Japan;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Aerospace electronics; Robot kinematics; Task analysis; Trajectory; Mathematical model; Voltage control;

机译：航空电子机器人运动学；任务分析；弹道;数学模型;电压控制;

相似文献

外文文献
中文文献
专利

1. Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems [J] . Hamidreza Modares, Frank L. Lewis, Mohammad-Bagher Naghibi-Sistani Automatica . 2014,第1期

机译：整体强化学习和经验重播，用于部分未知约束输入连续时间系统的自适应最优控制
2. Personalized vital signs control based on continuous action-space reinforcement learning with supervised experience [J] . Sun Chenxi, Hong Shenda, Song Moxian, Biomedical signal processing and control . 2021,第Auga期

机译：基于持续动作空间加固学习的个性化生命体征控制监督经验
3. Value iteration based integral reinforcement learning approach for H∞ controller design of continuous-time nonlinear systems [J] . Xiao Geyang, Zhang Huaguang, Zhang Kun, Neurocomputing . 2018,第APRa12期

机译：连续非线性系统H∞控制器设计的基于值迭代的积分强化学习方法
4. Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) For Learning Multi-Goal, Continuous Action and State Space Controllers [C] . Andreas Gerken, Michael Spranger International Conference on Robotics and Automation . 2019

机译：用于学习多目标，连续动作和状态空间控制器的连续价值迭代（CVI）增强学习和虚构体验重放（IER）
5. Autonomous mental development in high-dimensional and continuous state and action spaces and its application in autonomous learning of speech. [D] . Joshi, Ameet Vijay. 2003

机译：高维，连续状态和动作空间中的自主思维发展及其在语音自主学习中的应用。
6. Correction: Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail [O] . Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, 2009

机译：更正：在连续状态和动作空间中基于峰值的强化学习：当策略梯度方法失败时
7. Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning [O] . Abolfazl Lavaei, Fabio Somenzi, Sadegh Soudjani, 2020

机译：通过无模型增强学习的连续空间MDP的正式控制器合成

Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) For Learning Multi-Goal, Continuous Action and State Space Controllers

摘要

著录项

相似文献

相关主题

期刊订阅