【24h】

Parallel Rollout for Online Solution of dec-Pomdps

机译:并行部署dec-Pomdps在线解决方案

获取原文
获取原文并翻译 | 示例

摘要

A major research challenge is presented by scalability of algorithms for solving decentralized pomdps because of their double exponential worst-case complexity for finite horizon problems. First algorithms have only been able to solve very small instances on very small horizons. One exception is the Memory-Bounded Dynamic Programming algorithm - an approximation technique that has proved efficient in handling same sized problems but on large horizons. In this paper, we propose an online algorithm that also approximates larger instances of finite horizon dec-pomdps based on the Rollout algorithm. To evaluate the effectiveness of this approach, we compare the presented approach to a recently proposed algorithm called memory bounded dynamic programming. Experimental results show that despite the very high complexity of dec-pomdps, the combination of Rollout techniques and estimation techniques performs well and leads to a significant improvement of existing approximation techniques.
机译:求解有限点问题的算法的可扩展性提出了一个主要的研究挑战,这是因为它们具有有限时域问题的双指数最坏情况复杂性。最初的算法只能在很小的范围内求解很小的实例。一个例外是有内存限制的动态编程算法,这是一种近似技术,已被证明可有效地处理相同大小的问题,但适用范围广。在本文中,我们提出了一种在线算法,该算法还基于Rollout算法近似了有限水平dec-pomdps的较大实例。为了评估这种方法的有效性,我们将提出的方法与最近提出的称为内存有界动态规划的算法进行了比较。实验结果表明,尽管dec-pomdps的复杂性非常高,但Rollout技术和估计技术的结合效果很好,并导致了现有近似技术的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号