Communication-Based Decomposition Mechanisms for Decentralized MDPs

Claudia V. Goldman; Shlomo Zilberstein

首页> 外文期刊>The Journal of Artificial Intelligence Research >Communication-Based Decomposition Mechanisms for Decentralized MDPs

【24h】

Communication-Based Decomposition Mechanisms for Decentralized MDPs

机译：分散MDP的基于通信的分解机制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing, multi-robot coordination and information gathering scenarios can be formalized using this framework. However, finding the optimal solution in the general case is hard, limiting the applicability of recently developed algorithms. This paper provides a practical approach for solving decentralized control problems when communication among the decision makers is possible, but costly. We develop the notion of communication-based mechanism that allows us to decompose a decentralized MDP into multiple single-agent problems. In this framework, referred to as decentralized semi-Markov decision process with direct communication (Dec-SMDP-Com), agents operate separately between communications. We show that finding an optimal mechanism is equivalent to solving optimally a Dec-SMDP-Com. We also provide a heuristic search algorithm that converges on the optimal decomposition. Restricting the decomposition to some specific types of local behaviors reduces significantly the complexity of planning. In particular, we present a polynomial- time algorithm for the case in which individual agents perform goal-oriented behaviors between communications. The paper concludes with an additional tractable algorithm that enables the introduction of human knowledge, thereby reducing the overall problem to finding the best time to communicate. Empirical results show that these approaches provide good approximate solutions.

机译：可以将随机环境中的多主体规划正式定义为分散的马尔可夫决策问题。使用此框架可以将制造，多机器人协调和信息收集场景中出现的许多现实生活中的分布式问题正式化。但是，在一般情况下很难找到最佳解决方案，这限制了最近开发的算法的适用性。当决策者之间可以进行通讯但成本很高时，本文提供了一种解决分散控制问题的实用方法。我们开发了基于通信的机制的概念，该概念使我们可以将分散的MDP分解为多个单主体问题。在此框架中，称为直接通信的分散式半马尔可夫决策过程（Dec-SMDP-Com），代理在通信之间分别进行操作。我们表明，找到最优机制等同于最优地解决Dec-SMDP-Com。我们还提供了一种收敛于最优分解的启发式搜索算法。将分解限制在某些特定类型的本地行为中，可以大大降低计划的复杂性。特别是，对于单个代理在通信之间执行面向目标的行为的情况，我们提出了多项式时间算法。本文以附加的易处理算法作为结束语，该算法能够引入人类知识，从而减少总体问题，从而找到最佳的交流时间。实证结果表明，这些方法提供了良好的近似解决方案。

著录项

来源
《The Journal of Artificial Intelligence Research》 |2008年第0期|共34页
作者
Claudia V. Goldman; Shlomo Zilberstein;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Communication-Based Decomposition Mechanisms for Decentralized MDPs [J] . Goldman C. V., Zilberstein S. The Journal of Artificial Intelligence Research . 2008,第12期

机译：分散MDP的基于通信的分解机制
2. Communication-Based Decomposition Mechanisms for Decentralized MDPs [J] . Claudia V. Goldman, Shlomo Zilberstein The Journal of Artificial Intelligence Research . 2008,第0期

机译：分散MDP的基于通信的分解机制
3. Communication-Based Decomposition Mechanisms for Decentralized MDPs [J] . C. V. Goldman, S. Zilberstein Journal of Automation, Mobile Robotics & Intelligent Systems . 2008,第1期

机译：分散MDP的基于通信的分解机制
4. An Adaptive Dissemination Mechanism for Inter-Vehicle Communication-Based Decentralized Traffic Information Systems [C] . Huaying Xu, Matthew Barth IEEE Intelligent Transportation Systems Conference . 2006

机译：基于车间通信的分散交通信息系统的自适应传播机制
5. Decomposition and decentralized output control of large-scale systems. [D] . Finney, John Dennis. 1995

机译：大型系统的分解和分散输出控制。
6. Modeling and Planning with Macro-Actions in Decentralized POMDPs [O] . Christopher Amato, George Konidaris, Leslie P. Kaelbling, -1

机译：在分散的POMDP中使用宏动作进行建模和计划
7. Communication-Based Decomposition Mechanisms for Decentralized MDPs [O] . Goldman, Cluadia V., Zilberstein, Shlomo 2011

机译：基于通信的分散式mDp分解机制

Communication-Based Decomposition Mechanisms for Decentralized MDPs

摘要

著录项

相似文献

相关主题

期刊订阅