...
首页> 外文期刊>Automatic Control, IEEE Transactions on >Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex
【24h】

Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex

机译:值函数的充分条件和最优策略为拟凸的

获取原文
获取原文并翻译 | 示例
           

摘要

Sufficient conditions are identified under which the value function and the optimal strategy of a Markov decision process (MDP) are even and quasi-convex in the state. The key idea behind these conditions is the following. First, sufficient conditions for the value function and optimal strategy to be even are identified. Next, it is shown that if the value function and optimal strategy are even, then one can construct a “folded MDP” defined only on the nonnegative values of the state space. Then, the standard sufficient conditions for the value function and optimal strategy to be monotone are “unfolded” to identify sufficient conditions for the value function and the optimal strategy to be quasi-convex. The results are illustrated by using an example of power allocation in remote estimation.
机译:确定了充分条件,在该条件下,状态函数的价值函数和最优策略(马尔科夫决策过程)是均匀且近似凸的。这些条件背后的关键思想如下。首先,确定使价值函数和最优策略均等的充分条件。接下来,表明如果值函数和最优策略是偶数,则可以构造仅在状态空间的非负值上定义的“折叠MDP”。然后,“展开”用于价值函数和最优策略为单调的标准充分条件,以识别用于价值函数和最优策略为准凸的充分条件。通过使用远程估计中的功率分配示例来说明结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号