...
首页> 外文期刊>IEEE Robotics and Automation Letters >Practical Resolution Methods for MDPs in Robotics Exemplified With Disassembly Planning
【24h】

Practical Resolution Methods for MDPs in Robotics Exemplified With Disassembly Planning

机译:机器人中MDP的实用解决方法,以拆卸计划为例

获取原文
获取原文并翻译 | 示例
           

摘要

In this letter, we focus on finding practical resolution methods for Markov decision processes (MDPs) in robotics. Some of the main difficulties of applying MDPs to real-world robotics problems are: first, having to deal with huge state spaces; and second, designing a method that is robust enough to dead ends. These complications restrict or make more difficult the application of methods, such as value iteration, policy iteration, or labeled real-time dynamic programming (LRTDP). We see in determinization and heuristic search a way to successfully work around these problems. In addition, we believe that many practical use cases offer the opportunity to identify hierarchies of subtasks and solve smaller, simplified problems. We propose a decision-making unit that operates in a probabilistic planning setting through stochastic shortest path problems, which generalize the most common types of MDPs. Our decision-making unit combines: first, automatic hierarchical organization of subtasks; and second, on-line resolution via determinization. We argue that several applications of planning benefit from these two strategies. We exemplify our approach with a robotized disassembly application. The disassembly problem is modeled in probabilistic planning definition language, and serves to define our experiments. Our results show many advantages of our method over LRTDP, such as a better capability to handle problems with large state spaces and state definitions that change when new fluents are discovered.
机译:在这封信中,我们专注于为机器人技术中的马尔可夫决策过程(MDP)寻找实用的解决方法。将MDP应用于实际机器人问题的一些主要困难是:首先,必须处理巨大的状态空间;其次,设计一种足够健壮至死胡同的方法。这些复杂性限制或增加了方法的应用,例如值迭代,策略迭代或标记的实时动态编程(LRTDP)。我们在确定性和启发式搜索中看到了一种成功解决这些问题的方法。此外,我们认为许多实际用例为识别子任务的层次结构和解决较小的简化问题提供了机会。我们建议一个决策单位,通过随机的最短路径问题在概率计划中进行操作,该问题概括了最常见的MDP类型。我们的决策部门结合了:首先,自动执行子任务的分层组织;第二,通过确定性进行在线解析。我们认为,规划的几种应用都受益于这两种策略。我们以机器人拆卸应用为例来说明我们的方法。拆卸问题是用概率规划定义语言建模的,可用来定义我们的实验。我们的结果表明,与LRTDP相比,我们的方法具有许多优势,例如,具有更好的能力来处理具有较大状态空间的问题,并且在发现新流利液时状态定义会发生变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号