首页> 外文会议>IEEE Global Communications Conference >Multi-Agent Reinforcement Learning Enabling Dynamic Pricing Policy for Charging Station Operators

【24h】

Multi-Agent Reinforcement Learning Enabling Dynamic Pricing Policy for Charging Station Operators

机译：支持充电站运营商动态定价策略的多智能体强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The development of plug-in electric vehicles (PEVs) brings lucrative opportunities for charging station operators (CSOs). To attract more CSOs to the PEV market, provision of reasonable pricing policy is of great importance. However, dynamic environments and uncertain behavior of competitors make the pricing problem of CSOs challenging. In this paper, we focus on the dynamic pricing policy for maximizing the long-term profits of CSOs. Firstly, we propose a hierarchical framework to describe the economic association of PEV market, which is composed of smart grid, CSOs and charging stations (CSs) serving PEVs from top to bottom. Next, we leverage the Markov game to model the layer of CSOs as a competitive market. Finally, we design a dynamic pricing policy algorithm (DPPA) based on multi-agent reinforcement learning to achieve higher long-term profits of CSOs. Based on the real data of PEVs in Beijing, the experiment results show that DPPA has a significant improvement in long-term profit of CSOs, and the improvement gains increase over time. Moreover, DPPA can reduce the profit loss of CSOs effectively while involving more competitors.

机译：插入电动车辆（PEVS）的开发为充电站运营商（CSOS）带来了利润丰厚的机会。为了吸引更多的CSO到PEV市场，提供合理的定价政策具有重要意义。然而，动态环境和竞争对手的不确定行为使CSOS挑战的定价问题。在本文中，我们专注于最大化CSO的长期利润的动态定价政策。首先，我们提出了一个分层框架来描述PEV市场的经济协会，它由智能电网，CSOS和充电站（CSS）从顶部到底部提供PEV。接下来，我们利用马尔可夫游戏将CSO层层模拟为竞争市场。最后，我们设计了一种基于多功能钢筋学习的动态定价策略算法（DPPA），以实现CSO的较高长期利润。基于北京的PEV的真实数据，实验结果表明，DPPA对CSO的长期利润有显着改善，随着时间的推移，提升的提升增加。此外，DPPA可以有效地减少CSO的利润损失，同时涉及更多竞争对手。

著录项

来源
《IEEE Global Communications Conference》|2019年|1-6|共6页
会议地点
作者
Ye Han; Xuefei Zhang; Jian Zhang; Qimei Cui; Shuo Wang; Zhu Han;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Pricing; Cascading style sheets; Vehicle dynamics; Heuristic algorithms; Smart grids; Games; Learning (artificial intelligence);

机译：定价;级联样式表;车辆动力学;启发式算法;智能网格;游戏;学习（人工智能）;

相似文献

外文文献
中文文献
专利

1. A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem [J] . Paternina-Arboleda CD, Das TK Simulation modelling practice and theory: International journal of the Federation of European Simulation Societies . 2005,第5期

机译：一种用于随机批次调度问题的动态控制策略的多主体强化学习方法
2. MARVEL: Enabling controller load balancing in software-defined networks with multi-agent reinforcement learning [J] . Sun Penghao, Guo Zehua, Wang Gang, Computer networks . 2020,第Auga4期

机译：Marvel：通过多功能辅助增强学习，在软件定义网络中启用控制器负载平衡
3. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
4. Multi-Agent Reinforcement Learning Enabling Dynamic Pricing Policy for Charging Station Operators [C] . Ye Han, Xuefei Zhang, Jian Zhang, IEEE Global Communications Conference . 2019

机译：多功能加固学习，为充电站运营商提供动态定价策略
5. A study of interconnected dynamical systems and reinforcement learning in a multi-agent and distributed environment. [D] . Madera, Manuel. 2012

机译：在多主体和分布式环境中研究相互联系的动力系统和强化学习。
6. Routing of Electric Vehicles With Intermediary Charging Stations: A Reinforcement Learning Approach [O] . Marina Dorokhova, Christophe Ballif, Nicolas Wyrsch 2021

机译：带中介电动车辆的电动汽车路由：加强学习方法
7. An Online Reinforcement Learning Approach for Dynamic Pricing of Electric Vehicle Charging Stations [O] . Valeh Moghaddam, Amirmehdi Yazdani, Hai Wang, 2020

机译：电动汽车充电站动态定价的在线加固学习方法

Multi-Agent Reinforcement Learning Enabling Dynamic Pricing Policy for Charging Station Operators

摘要

著录项

相似文献

相关主题

期刊订阅