...
首页> 外文期刊>IEEE Transactions on Systems, Man, and Cybernetics >Model-Based and Learning-Based Decision Making in Incomplete Information Cournot Games: A State Estimation Approach
【24h】

Model-Based and Learning-Based Decision Making in Incomplete Information Cournot Games: A State Estimation Approach

机译:不完全信息古诺游戏中基于模型和基于学习的决策:一种状态估计方法

获取原文
获取原文并翻译 | 示例
           

摘要

In an incomplete information game, a big challenge is to find the best way of exploiting available information for optimal decision making of the agents. In this paper, two decision making methods, namely model-based and learning-based bidding strategies, are proposed and compared, for repeated Cournot competition of the generators in a day-ahead electricity market. The sum of the rivals’ offered quantities (SROQ) is considered as the state of the agent and its value is estimated using an adaptive expectation method. In the model-based approach, the convergence of the agents’ strategies to the Nash equilibrium point is also studied in two different cases. In the learning-based approach, the optimal bidding strategy is learned through combination of state estimation and a reinforcement learning method. Using the estimated state (SROQ), the optimal decision is learned through a fuzzy Q-learning algorithm. Through a case study, which is performed on the three-bus benchmark Cournot model, the convergence of the generators’ bids to the Nash-Cournot equilibrium is examined.
机译:在一个不完整的信息游戏中,最大的挑战是找到利用可用信息以最佳决策代理商的最佳方法。本文提出并比较了两种决策方法,即基于模型的出价策略和基于学习的出价策略,以在日前电力市场中反复进行发电机的古诺竞争。竞争对手提供的数量(SROQ)的总和被视为代理商的状态,其价值使用自适应期望方法进行估算。在基于模型的方法中,还研究了两种不同情况下代理商策略向Nash平衡点的收敛。在基于学习的方法中,通过状态估计和强化学习方法的组合来学习最佳投标策略。使用估计状态(SROQ),可通过模糊Q学习算法学习最佳决策。通过在三总线基准古诺模型上进行的案例研究,研究了发电商出价对纳什古诺均衡的收敛性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号