首页> 外文期刊>International Journal of Adaptive Control and Signal Processing >Implicit dual control based on particle filtering and forward dynamic programming
【24h】

Implicit dual control based on particle filtering and forward dynamic programming

机译:基于粒子滤波和前向动态规划的隐式双重控制

获取原文
获取原文并翻译 | 示例
           

摘要

This paper develops a sampling-based approach to implicit dual control. Implicit dual control methods synthesize stochastic control policies by systematically approximating the stochastic dynamic programming equations of Bellman, in contrast to explicit dual control methods that artificially induce probing into the control law by modifying the cost function to include a term that rewards learning. The proposed implicit dual control approach is novel in that it combines a particle filter with a policy-iteration method for forward dynamic programming. The integration of the two methods provides a complete sampling-based approach to the problem. Implementation of the approach is simplified by making use of a specific architecture denoted as a H-block. Practical suggestions are given for reducing computational loads within the H-block for real-time applications. As an example, the method is applied to the control of a stochastic pendulum model having unknown mass, length, initial position and velocity, and unknown sign of its dc gain. Simulation results indicate that active controllers based on the described method can systematically improve closed-loop performance with respect to other more common stochastic control approaches.
机译:本文提出了一种基于采样的隐式双重控制方法。隐式双重控制方法通过系统地近似Bellman的随机动态规划方程来综合随机控制策略,与显式双重控制方法相反,显式双重控制方法通过修改成本函数以包含奖励学习的术语来人为地引诱探索控制律。所提出的隐式双重控制方法是新颖的,因为它结合了粒子滤波器和策略迭代方法进行前向动态规划。两种方法的集成为问题提供了一个完整的基于采样的方法。通过使用表示为H块的特定体系结构,简化了该方法的实现。给出了减少实时应用中H块内计算负荷的实用建议。作为示例,该方法被应用于具有未知质量,长度,初始位置和速度以及其dc增益的未知符号的随机摆模型的控制。仿真结果表明,相对于其他更常见的随机控制方法,基于所述方法的主动控制器可以系统地提高闭环性能。

著录项

  • 来源
  • 作者单位

    Laboratory of Applied Pharmacokinetics. School of Medicine, University of Southern California, 2250 Alcazar St. CSC 134-B, Los Angeles. CA 90033. U.S.A. Jet Propulsion Laboratory, MS 198-326, 4800 Oak Grove Drive, CA 91109, U.S.A.;

    Laboratory of Applied Pharmacokinetics. School of Medicine, University of Southern California, 2250 Alcazar St. CSC 134-B, Los Angeles. CA 90033. U.S.A. Mathematics Department. University of Southern California. Los Angeles. CA 90089. U.S.A.;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《生物学医学文摘》(MEDLINE);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    implicit dual control; particle filtering; policy iteration; stochastic optimal control; dynamic programming;

    机译:隐式双重控制;粒子过滤政策迭代;随机最优控制;动态编程;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号