首页> 美国政府科技报告 >Separable Markovian Decision Problems: The Linear Programming Method in the Multichain Case (Revised).
【24h】

Separable Markovian Decision Problems: The Linear Programming Method in the Multichain Case (Revised).

机译:可分马尔可夫决策问题:多链情形中的线性规划方法(修订版)。

获取原文

摘要

Separable Markovian decision problems have the property that for certain pairs (i,a) of a state i and an action a: (1) the immediate reward is the sum of terms due to the current state and action (r(ia) = s(i) + t(a)), (2) the transition probability depends only on the action and not on the state from which the transition occurs. The separable model was studied already in the late sixties. For the discounted case and the unichain undiscounted case, a reduced LP formulation was given, which involves a substantially smaller number of variables than in the LP formulation of a general Markov decision problem. It was unknown whether such an efficient formulation was also possible in the multichain case. The paper solves the problem: such an efficient formulation can be obtained. Some applications of separable models are also presented.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号