...
首页> 外文期刊>Automatica >Stochastic iterative dynamic programming: a Monte Carlo approach to dual control
【24h】

Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

机译:随机迭代动态规划:双重控制的蒙特卡洛方法

获取原文
获取原文并翻译 | 示例
           

摘要

Practical exploitation of optimal dual control (ODC) theory continues to be hindered by the difficulties involved in numerically solving the associated stochastic dynamic programming (SDPs) problems. In particular, high-dimensional hyper-states coupled with the nesting of optimizations and integrations within these SDP problems render their exact numerical solution computationally prohibitive. This paper presents a new stochastic dynamic programming algorithm that uses a Monte Carlo approach to circumvent the need for numerical integration, thereby dramatically reducing computational requirements. Also, being a generalization of iterative dynamic programming (IDP) to the stochastic domain, the new algorithm exhibits reduced sensitivity to the hyper-state dimension and, consequently, is particularly well suited to solution of ODC problems. A convergence analysis of the new algorithm is provided, and its benefits are illustrated on the problem of ODC of an integrator with unknown gain, originally presented by angstrom strom and Helmersson (Computers and Mathematics with Applications 12A (1986) 653-662). (c) 2005 Elsevier Ltd. All rights reserved.
机译:最优双重控制(ODC)理论的实际开发继续受到数字解决相关随机动态规划(SDP)问题所涉及的困难的阻碍。特别是,高维超状态加上这些SDP问题中的优化和集成嵌套,使得它们的精确数值解在计算上令人望而却步。本文提出了一种新的随机动态规划算法,该算法使用蒙特卡洛方法来规避数值积分的需求,从而显着降低了计算需求。同样,作为迭代动态规划(IDP)到随机域的推广,新算法对超状态维的敏感度降低,因此特别适合解决ODC问题。提供了新算法的收敛性分析,并说明了由未知增益的积分器的ODC问题所带来的好处,该问题最初由Angstrom strom和Helmersson提出(《计算机和数学与应用》 12A(1986)653-662)。 (c)2005 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号