首页> 外文会议>BICA Society., Meeting >An Intrinsically Motivated Robot Explores Non-reward Environments with Output Arbitration

【24h】

An Intrinsically Motivated Robot Explores Non-reward Environments with Output Arbitration

机译：本质上动机的机器人探讨了具有输出仲裁的非奖励环境

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In real worlds, rewards are easily sparse because the state space is huge. Reinforcement learning agents have to achieve exploration skills to get rewards in such an environment. In that case, curiosity defined as internally generated rewards for state prediction error can encourage agents to explore environments. However, when a robot learns its policy by reinforcement learning, changing outputs of the policy cause jerking because of inertia. Jerking prevents state prediction from convergence, which would make the policy learning unstable. In this paper, we propose Arbitrable Intrinsically Motivated Exploration (AIME), which enables robots to stably learn curiosity-based exploration. AIME uses Accumulator Based Arbitration Model (ABAM) that we previously proposed as an ensemble learning method inspired by prefrontal cortex. ABAM adjusts motor controls to improve stability of reward generation and reinforcement learning. In experiments, we show that a robot can explore a non-reward simulated environment with AIME.

机译：在现实世界中，由于国家空间巨大，奖励很稀疏。强化学习代理必须实现在这种环境中获得奖励的探索技能。在这种情况下，定义为状态预测误差的内部生成奖励的好奇心可以鼓励代理商探索环境。但是，当机器人通过加强学习来学习其政策时，改变政策的输出因惯性而导致混乱。 Jerking防止了融合的状态预测，这将使策略学习不稳定。在本文中，我们提出了可备注的内在动机勘探（AIME），使机器人能够稳定地学习基于好奇心的探索。 AIME采用基于蓄电池仲裁示范（ABAM），我们先前提出由前额叶皮层的启发合奏学习方法。 ABAM调整电机控制，以提高奖励生成和加强学习的稳定性。在实验中，我们表明机器人可以用气动画探索非奖励模拟环境。

著录项

来源
《BICA Society., Meeting》|2019年|xv 362 pages|共7页
会议地点
作者
Takuma Seno; Masahiko Osawa; Michita Imai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
Deep reinforcement learning; Robot navigation; Prefrontal cortex;

机译：深增强学习;机器人导航;前额外皮质;

相似文献

外文文献
中文文献
专利

1. 非确定环境下移动机器人多传感器系统及信息处理 [J] . 乔凤斌, 杨汝清东南大学学报（英文版） . 2004,第003期
2. Deep intrinsically motivated continuous actor-critic for efficient robotic visuomotor skill learning [J] . Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Paladyn: Journal of Behavioral Robotics . 2019,第1期

机译：深度本质上促进的连续演员 - 高效机器人保护技能学习
3. Intrinsically motivated model learning for developing curious robots [J] . Todd Hester, Peter Stone Artificial intelligence . 2017,第Juna期

机译：基于内在动机的模型学习，用于开发好奇的机器人
4. GRAIL: A Goal-Discovering Robotic Architecture for Intrinsically-Motivated Learning [J] . Vieri Giuliano Santucci, Gianluca Baldassarre, Marco Mirolli IEEE Transactions on Cognitive and Developmental Systems . 2016,第3期

机译：GRAIL：用于内在学习的目标发现机器人体系结构
5. An Intrinsically Motivated Robot Explores Non-reward Environments with Output Arbitration [C] . Takuma Seno, Masahiko Osawa, Michita Imai BICA Society., Meeting . 2019

机译：本质上动机的机器人探讨了具有输出仲裁的非奖励环境
6. Exploring the experiences of upper elementary school children who are intrinsically motivated to seek information. [D] . Crow, Sherry R. 2009

机译：探索内在动机寻求信息的高中小学儿童的经历。
7. Editorial: Intrinsically Motivated Open-Ended Learning in Autonomous Robots [O] . Vieri Giuliano Santucci, Pierre-Yves Oudeyer, Andrew Barto, 2019

机译：社论：自主机器人的内在动机开放式学习
8. The Project IM-CLeVeR - Intrinsically Motivated Cumulative Learning Versatile Robots: A Tool-box for Research on Intrinsic Motivations and Cumulative Learning [O] . Baldassarre Gianluca, Santucci Vieri Giuliano, Mirolli Marco, 2013

机译：IM-CLeVeR项目-本能动机累积学习多功能机器人：用于研究本能动机和累积学习的工具箱
9. Intrinsically Motivated Reinforcement Learning: A Promising Framework for Developmental Robot Learning [R] . Stout, A. , Konidaris, G. D. , Barto, A. G. 2005

机译：本质动机强化学习：发展机器人学习的有前途的框架

An Intrinsically Motivated Robot Explores Non-reward Environments with Output Arbitration

摘要

著录项

相似文献

相关主题

期刊订阅