...
首页> 外文期刊>電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing >Identification of partially observable environment based on on-line variational Bayes method and its application to reinforcement learning
【24h】

Identification of partially observable environment based on on-line variational Bayes method and its application to reinforcement learning

机译:基于在线变分贝叶斯方法的部分可观测环境识别及其在强化学习中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

In this report, we present an on-line variational Bayes (VB) method for system identification based on linear state space models. The learning algorithm is implemented as the maximization of an on-line free energy, which can be used for determining the dimension of the internal state. We also propose a reinforcement learning (RL) method using this system identification method. Our RL method is applied to a simple automatic control problem with hidden state variables. The result shows that our method is able to determine correctly the dimension of the internal state and to acquire a good control, even in a partially observable environment.
机译:在本报告中,我们提出了一种基于线性状态空间模型的在线变分贝叶斯(VB)方法进行系统识别。该学习算法被实现为在线自由能的最大化,该自由能可用于确定内部状态的维数。我们还提出了使用这种系统识别方法的强化学习(RL)方法。我们的RL方法适用于带有隐藏状态变量的简单自动控制问题。结果表明,即使在部分可观察的环境中,我们的方法也能够正确确定内部状态的维数并获得良好的控制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号