Implicit dual control based on particle filtering and forward dynamic programming

David S. Bayard; Alan Schumitzky

首页> 外文期刊>International Journal of Adaptive Control and Signal Processing >Implicit dual control based on particle filtering and forward dynamic programming

【24h】

Implicit dual control based on particle filtering and forward dynamic programming

机译：基于粒子滤波和前向动态规划的隐式双重控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper develops a sampling-based approach to implicit dual control. Implicit dual control methods synthesize stochastic control policies by systematically approximating the stochastic dynamic programming equations of Bellman, in contrast to explicit dual control methods that artificially induce probing into the control law by modifying the cost function to include a term that rewards learning. The proposed implicit dual control approach is novel in that it combines a particle filter with a policy-iteration method for forward dynamic programming. The integration of the two methods provides a complete sampling-based approach to the problem. Implementation of the approach is simplified by making use of a specific architecture denoted as a H-block. Practical suggestions are given for reducing computational loads within the H-block for real-time applications. As an example, the method is applied to the control of a stochastic pendulum model having unknown mass, length, initial position and velocity, and unknown sign of its dc gain. Simulation results indicate that active controllers based on the described method can systematically improve closed-loop performance with respect to other more common stochastic control approaches.

机译：本文提出了一种基于采样的隐式双重控制方法。隐式双重控制方法通过系统地近似Bellman的随机动态规划方程来综合随机控制策略，与显式双重控制方法相反，显式双重控制方法通过修改成本函数以包含奖励学习的术语来人为地引诱探索控制律。所提出的隐式双重控制方法是新颖的，因为它结合了粒子滤波器和策略迭代方法进行前向动态规划。两种方法的集成为问题提供了一个完整的基于采样的方法。通过使用表示为H块的特定体系结构，简化了该方法的实现。给出了减少实时应用中H块内计算负荷的实用建议。作为示例，该方法被应用于具有未知质量，长度，初始位置和速度以及其dc增益的未知符号的随机摆模型的控制。仿真结果表明，相对于其他更常见的随机控制方法，基于所述方法的主动控制器可以系统地提高闭环性能。

著录项

来源
《International Journal of Adaptive Control and Signal Processing》 |2010年第3期|155-177|共23页
作者
David S. Bayard; Alan Schumitzky;
展开▼
作者单位

Laboratory of Applied Pharmacokinetics. School of Medicine, University of Southern California, 2250 Alcazar St. CSC 134-B, Los Angeles. CA 90033. U.S.A. Jet Propulsion Laboratory, MS 198-326, 4800 Oak Grove Drive, CA 91109, U.S.A.;

Laboratory of Applied Pharmacokinetics. School of Medicine, University of Southern California, 2250 Alcazar St. CSC 134-B, Los Angeles. CA 90033. U.S.A. Mathematics Department. University of Southern California. Los Angeles. CA 90089. U.S.A.;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《生物学医学文摘》(MEDLINE);
原文格式 PDF
正文语种 eng
中图分类
关键词
implicit dual control; particle filtering; policy iteration; stochastic optimal control; dynamic programming;

机译：隐式双重控制;粒子过滤政策迭代;随机最优控制;动态编程;

相似文献

外文文献
中文文献
专利

1. Residual life prediction based on dynamic weighted Markov model and particle filtering [J] . Zhang Shuai, Zhang Yongxiang, Zhu Jieping Journal of Intelligent Manufacturing . 2018,第4期

机译：基于动态加权的马尔可夫模型和粒子滤波的残余寿命预测
2. Forward search algorithm based on dynamic programming for real-time adaptive traffic signal control [J] . Yin Biao, Dridi Mahjoub, El Moudni Abdellah Intelligent Transport Systems, IET . 2015,第7期

机译：基于动态规划的前向搜索算法实时自适应交通信号控制
3. Programming the composition of polymer blend particles for controlled immunity towards individual protein antigens [J] . Zhan Xi, Shen Hong Vaccine . 2015,第23期

机译：对聚合物共混物颗粒的组成进行编程，以控制对单个蛋白质抗原的免疫
4. LS-SVM based neural controller as optimized by particle swarm algorithm using dual heuristic dynamic programming [C] . Si-Yao Fu, Guo-Sheng Yang, Zeng-Guang Hou International Joint Conference on Neural Networks;IJCNN 2009 . 2009

机译：基于粒子群算法的双重启发式动态规划优化的基于LS-SVM的神经控制器
5. A control architecture for dynamic execution of robot tasks trained in real-time using particle filters. [D] . Stanhope, Austin. 2009

机译：一种控制架构，用于动态执行使用粒子过滤器实时训练的机器人任务。
6. IMPLICIT DUAL CONTROL BASED ON PARTICLE FILTERING AND FORWARD DYNAMIC PROGRAMMING [O] . David S. Bayard, Alan Schumitzky -1

机译：基于粒子滤波和前向动态规划的隐式双控制
7. Reply to comment by J. M. Albert on “On the numerical simulation of particle dynamics in the radiation belt. Part I: Implicit and semi-implicit schemes” and “On the numerical simulation of particle dynamics in the radiation belt. Part II: Procedure based [O] . E. Camporeale, G. L. Delzanno, S. Zaharia, 2013

机译：回复J. M.Albert关于“辐射带粒子动力学数值模拟”的回复。第一部分：隐式和半隐式方案“和”辐射带中粒子动力学数值模拟“和”。第二部分：基于程序
8. Reduced Order Model Based Feedback Control of Large-Scale Aeroelastic Simulations: Residual State Filter Model Reduction Compensation and Application to F-16 Dynamic Models [R] . Balas, M. J. , Fagley, C. 2008

机译：基于降阶模型的大型气动弹性模拟反馈控制：残余状态滤波器模型约简补偿及其在F-16动力学模型中的应用

Implicit dual control based on particle filtering and forward dynamic programming

摘要

著录项

相似文献

相关主题

期刊订阅