...
首页> 外文期刊>Wireless Communications Letters, IEEE >Q-Learning Based Energy Management Policies for a Single Sensor Node with Finite Buffer
【24h】

Q-Learning Based Energy Management Policies for a Single Sensor Node with Finite Buffer

机译:具有有限缓冲的单个传感器节点基于Q学习的能量管理策略

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we consider the problem of finding optimal energy management policies in the presence of energy harvesting sources to maximize network performance. We formulate this problem in the discounted cost Markov decision process framework and apply two reinforcement learning algorithms. Prior work obtains optimal policy in the case when the conversion function mapping energy to data transmitted is linear and provides heuristic policies in the case when the same is nonlinear. Our algorithms, however, provide optimal policies regardless of the form of the conversion function. Through simulations, our policies are seen to outperform those of in the nonlinear case.
机译:在本文中,我们考虑在存在能量收集源的情况下寻找最佳能量管理策略以最大化网络性能的问题。我们在折现成本马尔可夫决策过程框架中制定此问题,并应用两种强化学习算法。在将能量映射到所传输的数据的转换函数是线性的情况下,现有技术获得了最佳策略,而在非线性的情况下,则提供了启发式策略。但是,无论转换函数的形式如何,我们的算法都能提供最佳策略。通过仿真,我们的策略被认为优于非线性情况下的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号