Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

Thompson AM; Cluett WR

首页> 外文期刊>Automatica >Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

【24h】

Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

机译：随机迭代动态规划：双重控制的蒙特卡洛方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Practical exploitation of optimal dual control (ODC) theory continues to be hindered by the difficulties involved in numerically solving the associated stochastic dynamic programming (SDPs) problems. In particular, high-dimensional hyper-states coupled with the nesting of optimizations and integrations within these SDP problems render their exact numerical solution computationally prohibitive. This paper presents a new stochastic dynamic programming algorithm that uses a Monte Carlo approach to circumvent the need for numerical integration, thereby dramatically reducing computational requirements. Also, being a generalization of iterative dynamic programming (IDP) to the stochastic domain, the new algorithm exhibits reduced sensitivity to the hyper-state dimension and, consequently, is particularly well suited to solution of ODC problems. A convergence analysis of the new algorithm is provided, and its benefits are illustrated on the problem of ODC of an integrator with unknown gain, originally presented by angstrom strom and Helmersson (Computers and Mathematics with Applications 12A (1986) 653-662). (c) 2005 Elsevier Ltd. All rights reserved.

机译：最优双重控制（ODC）理论的实际开发继续受到数字解决相关随机动态规划（SDP）问题所涉及的困难的阻碍。特别是，高维超状态加上这些SDP问题中的优化和集成嵌套，使得它们的精确数值解在计算上令人望而却步。本文提出了一种新的随机动态规划算法，该算法使用蒙特卡洛方法来规避数值积分的需求，从而显着降低了计算需求。同样，作为迭代动态规划（IDP）到随机域的推广，新算法对超状态维的敏感度降低，因此特别适合解决ODC问题。提供了新算法的收敛性分析，并说明了由未知增益的积分器的ODC问题所带来的好处，该问题最初由Angstrom strom和Helmersson提出（《计算机和数学与应用》 12A（1986）653-662）。（c）2005 Elsevier Ltd.保留所有权利。

著录项

来源
《Automatica》 |2005年第5期|共12页
作者
Thompson AM; Cluett WR;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化基础理论;
关键词
dual control; adaptive control; stochastic systems; dynamic programming; optimal control; uncertainty; APPROXIMATIONS; OPTIMIZATION; SELECTION; RANKING;

机译：双控制;自适应控制;随机系统;动态规划;最优控制;不确定性;逼近;优化;选择;排名;

相似文献

外文文献
中文文献
专利

1. Stochastic iterative dynamic programming: a Monte Carlo approach to dual control [J] . Thompson AM, Cluett WR Automatica . 2005,第5期

机译：随机迭代动态规划：双重控制的蒙特卡洛方法
2. Controlling Procedural Modeling Programs with Stochastically-Ordered Sequential Monte Carlo [J] . Daniel Ritchie, Ben Mildenhall, Noah D. Goodman, ACM Transactions on Graphics . 2015,第4CD期

机译：用随机顺序的顺序蒙特卡洛控制程序建模程序
3. Monte Carlo methods via a dual approach for some discrete time stochastic control problems [J] . Gyurko Lajos Gergely, Hambly Ben M., Witte Jan Hendrik Mathematical methods of operations research . 2015,第1期

机译：通过对偶方法的蒙特卡洛方法来解决一些离散时间随机控制问题
4. Computational Complexity of Stochastic Programming: Monte Carlo Sampling Approach [C] . Alexander Shapiro International Congress of Mathematicians . 2010

机译：随机编程的计算复杂性：蒙特卡罗采样方法
5. Portfolio optimization and dynamic hedging with receding horizon control, stochastic programming, and Monte Carlo simulation. [D] . Meindl, Peter James. 2007

机译：通过后退的水平控制，随机规划和蒙特卡洛模拟进行投资组合优化和动态套期保值。
6. Assessing the effect of child’s gender on their father–mother perception of the PedsQL™ 4.0 questionnaire: an iterative hybrid ordinal logistic regression/item response theory approach with Monte Carlo simulation [O] . Marziyeh Doostfatemeh, Seyyed Mohammad Taghi Ayatollahi, Peyman Jafari 2020

机译：评估儿童性别对父母母亲的母亲对PEDSQL™4.0调查问卷的影响：蒙特卡罗模拟的迭代混合序数回归/项目响应理论方法
7. Monte Carlo methods via a dual approach for some discrete time stochastic control problems [O] . Gyurko, LG, Hambly, B, Witte, JH 2015

机译：蒙特卡罗方法通过对偶方法进行离散时间随机控制问题

Stochastic iterative dynamic programming: a Monte Carlo approach to dual control

摘要

著录项

相似文献

相关主题

期刊订阅