首页> 外文会议>Foundations of software science and computational structures >The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes
【24h】

The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes

机译:马尔可夫决策过程中基于图的约简复杂性

获取原文
获取原文并翻译 | 示例

摘要

We study the never-worse relation (NWR) for Markov decision processes with an infinite-horizon reachability objective. A state q is never worse than a state p if the maximal probability of reaching the target set of states from p is at most the same value from q, regardless of the probabilities labelling the transitions. Extremal-probability states, end components, and essential states are all special cases of the equivalence relation induced by the NWR. Using the NWR, states in the same equivalence class can be collapsed. Then, actions leading to sub-optimal states can be removed. We show that the natural decision problem associated to computing the NWR is CONP-complete. Finally, we extend a previously known incomplete polynomial-time iterative algorithm to under-approximate the NWR.
机译:我们研究了具有无限水平可达性目标的马尔可夫决策过程的永无忧虑关系(NWR)。如果从p到达状态的目标状态集的最大概率与从q到来的值最大相同,则状态q永远不会比状态p差,而与标记过渡的概率无关。极值概率状态,最终成分和基本状态都是NWR引起的等价关系的特殊情况。使用NWR,可以折叠相同等效类中的状态。然后,可以删除导致次优状态的动作。我们表明,与计算NWR相关的自然决策问题是CONP完全的。最后,我们扩展了先前已知的不完全多项式时间迭代算法,以使NWR近似不足。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号