...
首页> 外文期刊>The Journal of Artificial Intelligence Research >First Order Decision Diagrams for Relational MDPs
【24h】

First Order Decision Diagrams for Relational MDPs

机译:关系MDP的一阶决策图

获取原文
获取原文并翻译 | 示例
           

摘要

Markov decision processes capture sequential decision making under uncertainty, where an agent must choose actions so as to optimize long term reward. The paper studies efficient reasoning mechanisms for Relational Markov Decision Processes (RMDP) where world states have an internal relational structure that can be naturally described in terms of objects and relations among them. Two contributions are presented. First, the paper develops First Order Decision Diagrams (FODD), a new compact representation for functions over relational structures, together with a set of operators to combine FODDs, and novel reduction techniques to keep the representation small. Second, the paper shows how FODDs can be used to develop solutions for RMDPs, where reasoning is performed at the abstract level and the resulting optimal policy is independent of domain size (number of objects) or instantiation. In particular, a variant of the value iteration algorithm is developed by using special operations over FODDs, and the algorithm is shown to converge to the optimal policy.
机译:马尔可夫决策过程可捕捉不确定性下的顺序决策,代理商必须选择行动以优化长期回报。本文研究了关系马尔可夫决策过程(RMDP)的有效推理机制,其中世界状态具有内部关系结构,可以根据对象及其之间的关系自然地对其进行描述。提出了两个贡献。首先,本文开发了“一阶决策图”(FODD),关系结构上函数的新紧凑表示形式,以及用于组合FODD的一组运算符以及新颖的归约技术,以使表示形式变小。其次,本文展示了如何将FODD用于开发RMDP的解决方案,其中在抽象级别执行推理,并且所得到的最优策略与域大小(对象数)或实例化无关。特别是,通过使用FODD上的特殊运算来开发了值迭代算法的一种变体,并且该算法被证明收敛于最优策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号