首页> 外文会议> >On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains

【24h】

On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains

机译：关于使用离散Cohen-Grossberg节点动力学进行非马尔可夫域中的无模型演员批评神经学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe how multi-stage non-Markovian decision problems can be solved using actor-critic reinforcement learning by assuming that a discrete version of Cohen-Grossberg node dynamics describes the node-activation computations of neural network (NN). Our NN is capable of rendering the process Markovian implicitly and automatically in a totally model-free fashion without learning by how much the state apace must be augmented so that the Markov property holds. This serves as an alternative to using Elman or Jordan-type function as a history memory in order to develop sensitivity to non-Markovian dependencies. We shall demonstrate our concept using a small-scale non-Markovian deterministic path problem, in which our actor-critic NN finds an optimal sequence of actions, although it needs much iteration due to the nature of neural model-free learning. This is, in spirit, a neuro-dynamic programming approach.

机译：通过假设离散版本的Cohen-Grossberg节点动力学描述了神经网络（NN）的节点激活计算，我们描述了如何使用演员批评强化学习来解决多阶段非马尔科夫决策问题。我们的NN能够以完全无模型的方式隐式自动地渲染过程Markovian，而无需了解必须增加多少状态空间以保持Markov属性。这可以替代使用Elman或Jordan类型的函数作为历史记忆，从而提高对非Markovian依赖关系的敏感性。我们将使用小规模的非马尔可夫确定性路径问题来证明我们的概念，在该问题中，尽管由于无神经模型学习的本质，它需要大量迭代，但我们的行为准则神经网络仍可以找到最佳的动作序列。从本质上讲，这是一种神经动力学编程方法。

著录项

来源
《》|2003年|p.1-6|共6页
会议地点
作者
Mizutani; E.; Dreyfus; S.E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
neural nets; learning (artificial intelligence); Markov processes; decision making; discretized Cohen-Grossberg node dynamics; model-free actor-critic neural learning; nonMarkovian domains; neural networks; small-scale nonMarkovian deterministic path pro;

机译：神经网络;学习（人工智能）;马尔可夫过程;决策制定;离散的Cohen-Grossberg节点动力学;无模型演员批评神经学习;非马尔可夫域;神经网络;小规模非马尔可夫确定性路径;

相似文献

外文文献
中文文献
专利

1. Totally model-free actor-critic recurrent neural-network reinforcement learning in non-Markovian domains [J] . Mizutani Eiji, Dreyfus Stuart Annals of Operations Research . 2017,第1期

机译：非马尔可夫域中的完全无模型的actor-critic递归神经网络强化学习
2. Estimation of the Domain of Attraction of Discrete-Time Impulsive Cohen-Grossberg Neural Networks Model With Impulse Input Saturation [J] . Shen Zixiang, Li Chuandong, Li Yi Neural processing letters . 2021,第3期

机译：脉冲输入饱和度离散脉冲COHEN-GROSSBERG神经网络模型的景点估算
3. Dynamic behaviours for semi-discrete stochastic Cohen-Grossberg neural networks with time delays [J] . Zhang Tianwei, Han Sufang, Zhou Jianwen Journal of the Franklin Institute . 2020,第17期

机译：半离散随机科恩格洛斯伯格与时滞的动态行为
4. On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains [C] . Eiji Mizutani, Stuart E. Dreyfus IEEE International Symposium on Computational Intelligence in Robotics and Automation . 2003

机译：在非马洛维亚域中使用离散的COHEN-GROSSBERG节点动态进行模型actor批评的
5. Dynamic tuning of PI-controllers based on model-free Reinforcement Learning methods. [D] . Abbasi Brujeni, Lena. 2010

机译：基于无模型强化学习方法的PI控制器的动态调整。
6. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives [O] . Cai Li, Robert Lowe, Tom Ziemke 2014

机译：运动学习的新方法：使用中央模式生成器和动态运动原语的Actor-Critic体系结构
7. Totally Model-Free Reinforcement Learning by Actor-Critic Elman Networks in Non-Markovian Domains [O] . Eiji Mizutani, Stuart E Dreyfus 1998

机译：非markovian领域的演员 - 评论家Elman网络完全无模型强化学习

On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains

摘要

著录项

相似文献

相关主题

期刊订阅