首页> 外文会议>IEEE International Conference on Robotics and Automation >DISCO: Double Likelihood-free Inference Stochastic Control
【24h】

DISCO: Double Likelihood-free Inference Stochastic Control

机译:DISCO:无双重似然推断随机控制

获取原文

摘要

Accurate simulation of complex physical systems enables the development, testing, and certification of control strategies before they are deployed into the real systems. As simulators become more advanced, the analytical tractability of the differential equations and associated numerical solvers incorporated in the simulations diminishes, making them difficult to analyse. A potential solution is the use of probabilistic inference to assess the uncertainty of the simulation parameters given real observations of the system. Unfortunately the likelihood function required for inference is generally expensive to compute or totally intractable. In this paper we propose to leverage the power of modern simulators and recent techniques in Bayesian statistics for likelihood-free inference to design a control framework that is efficient and robust with respect to the uncertainty over simulation parameters. The posterior distribution over simulation parameters is propagated through a potentially non-analytical model of the system with the unscented transform, and a variant of the information theoretical model predictive control. This approach provides a more efficient way to evaluate trajectory roll outs than Monte Carlo sampling, reducing the online computation burden. Experiments show that the controller proposed attained superior performance and robustness on classical control and robotics tasks when compared to models not accounting for the uncertainty over model parameters.
机译:复杂物理系统的准确仿真可以在将控制策略部署到实际系统中之前进行开发,测试和认证。随着仿真器的发展,仿真中包含的微分方程和相关数值解算器的分析可处理性逐渐降低,从而使其难以分析。一个潜在的解决方案是使用概率推理来评估给定系统的真实观测值的仿真参数的不确定性。不幸的是,推理所需的似然函数通常计算起来十分昂贵或完全难以处理。在本文中,我们建议利用贝叶斯统计中的现代仿真器和最新技术的功能进行无可能性推断,以针对仿真参数的不确定性设计高效且鲁棒的控制框架。仿真参数的后验分布通过具有无味变换的系统潜在的非分析模型和信息理论模型预测控制的变体传播。与蒙特卡洛采样相比,此方法提供了一种评估轨迹推出的更有效方法,从而减轻了在线计算负担。实验表明,与不考虑模型参数不确定性的模型相比,该控制器提出的经典控制和机器人任务性能和鲁棒性更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号