Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

机译：非平稳环境中步长参数的递归自适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for non-stationary environments. When the environment is non-stationary, the learning agent must adapt learning parameters like stepsize to the changes of environment through continuous learning. We show several theorems on higher-order derivatives of exponential moving average, which is a base schema of major reinforcement learning methods, using stepsize parameters. We also derive a systematic mechanism to calculate these derivatives in a recursive manner. Based on it, we construct a precise and flexible adaptation method for the stepsize parameter in order to maximize a certain criterion. The proposed method is also validated by several experimental results.

机译：在本文中，我们提出了一种用于调整用于非平稳环境的强化学习中的逐步调整参数的方法。当环境不稳定时，学习代理必须通过连续学习使学习参数（如逐步调整）适应环境的变化。我们展示了关于指数移动平均的高阶导数的几个定理，该定理是使用步长参数的主要强化学习方法的基本架构。我们还推导了一种系统的机制，以递归的方式计算这些导数。在此基础上，我们为stepsize参数构造了一种精确而灵活的自适应方法，以最大化某个准则。几种实验结果也验证了该方法的有效性。

著录项

来源
《International conference on principles of practice in multi-agent systems;PRIMA 2009》|2009年|P.525-533|共9页
会议地点
作者
Itsuki Noda;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Recursive fuzzy instrumental variable based evolving neuro-fuzzy identification for non-stationary dynamic system in a noisy environment [J] . de Oliveira Serra Ginalber Luiz, Rocha Filho Orlando Donato Fuzzy sets and systems . 2018,第MAY1期

机译：噪声环境下非平稳动态系统的基于递归模糊工具变量的进化神经模糊辨识
2. Recursive estimation of model parameters with sharp discontinuity in non-stationary air quality data [J] . C.N. Ng, T.L. Yan Environmental Modelling & Software . 2004,第1期

机译：在非平稳空气质量数据中具有不连续性的模型参数的递归估计
3. Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments [J] . Takuya Okano, Itsuki Noda Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：基于非静止环境下多智能体增强学习中均衡方向的勘探率的适应方法
4. Recursive Adaptation of Stepsize Parameter for Non-stationary Environments [C] . Itsuki Noda Workshop on Adaptive and Learning Agents . 2010

机译：递归适应非静止环境的步骤化参数
5. Recursive Parameter Estimation using Polynomial Chaos Theory Applied to Vehicle Mass Estimation for Rough Terrain. [D] . Pence, Benjamin Lynn. 2011

机译：基于多项式混沌理论的递归参数估计在粗糙地形车辆质量估计中的应用。
6. Diffusion Logarithm-Correntropy Algorithm for Parameter Estimation in Non-Stationary Environments over Sensor Networks [O] . Limei Hu, Feng Chen, Shukai Duan, 2018

机译：传感器网络非平稳环境中参数估计的对数扩散对数算法
7. Recursive Adaptation of Stepsize Parameter for Non-Stationary Environments [O] . Itsuki Noda 2010

机译：非平稳环境中步长参数的递归自适应

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

摘要

著录项

相似文献

相关主题

期刊订阅