首页> 外文会议> >On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains
【24h】

On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains

机译:关于使用离散Cohen-Grossberg节点动力学进行非马尔可夫域中的无模型演员批评神经学习

获取原文

摘要

We describe how multi-stage non-Markovian decision problems can be solved using actor-critic reinforcement learning by assuming that a discrete version of Cohen-Grossberg node dynamics describes the node-activation computations of neural network (NN). Our NN is capable of rendering the process Markovian implicitly and automatically in a totally model-free fashion without learning by how much the state apace must be augmented so that the Markov property holds. This serves as an alternative to using Elman or Jordan-type function as a history memory in order to develop sensitivity to non-Markovian dependencies. We shall demonstrate our concept using a small-scale non-Markovian deterministic path problem, in which our actor-critic NN finds an optimal sequence of actions, although it needs much iteration due to the nature of neural model-free learning. This is, in spirit, a neuro-dynamic programming approach.
机译:通过假设离散版本的Cohen-Grossberg节点动力学描述了神经网络(NN)的节点激活计算,我们描述了如何使用演员批评强化学习来解决多阶段非马尔科夫决策问题。我们的NN能够以完全无模型的方式隐式自动地渲染过程Markovian,而无需了解必须增加多少状态空间以保持Markov属性。这可以替代使用Elman或Jordan类型的函数作为历史记忆,从而提高对非Markovian依赖关系的敏感性。我们将使用小规模的非马尔可夫确定性路径问题来证明我们的概念,在该问题中,尽管由于无神经模型学习的本质,它需要大量迭代,但我们的行为准则神经网络仍可以找到最佳的动作序列。从本质上讲,这是一种神经动力学编程方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号