基于多代理强化学习的虚拟电厂参与调峰市场的优化策略

祁兵; 彭浩宇; 李彬; 张恒源; 祁书珩

doi:10.19725/j.cnki.1007-2322.2024.0123

基于多代理强化学习的虚拟电厂参与调峰市场的优化策略

Optimization Strategy for Virtual Power Plants Participating in Peak Shaving Market Based on Multi-agent Reinforcement Learning

摘要

摘要: 为解决虚拟电厂中多可调节资源间因信息不完全及竞争制约关系导致的出力决策困难问题，提出一种基于多代理强化学习与贝叶斯博弈的虚拟电厂多主体协同优化策略。针对不同资源的特性差异，构建包含多类可调节资源的虚拟电厂模型。针对虚拟电厂参与调峰市场的优化调度问题，构建基于环境-经济协同目标的贝叶斯博弈模型，优化各资源主体的出力策略，并证明该博弈模型纳什均衡解的存在性。最后，在不完全信息市场环境下，采用多代理强化学习算法联合贝叶斯博弈对上述模型进行求解，得出虚拟电厂多目标下多主体的优化调度方案。仿真结果显示，在非全信息可调控资源的博弈模型下，所得虚拟电厂优化调度方案可有效提升经济效益并减少碳排放，在经济与环境多目标下实现双赢。

Abstract: To address the challenge of service selection due to the incomplete information, such as the participation of multi-adjustable resources in the virtual power plant and the competition restriction relationship, a multi-agent game strategy for virtual power plants is proposed based on multi-agent reinforcement learning and Bayesian game. Firstly, a virtual power plant model containing multiple adjustable resources is established according to different resource characteristics. Secondly, a Bayesian game model for the optimal dispatch of virtual power plants participating in the peak regulation market is established, considering environment and economy coordination. The output of virtual power plant resources participating in peak regulation market is optimized, thereby confirming the existence of a Nash equilibrium solution. Finally, in the incomplete information market environment, the multi-agent reinforcement learning algorithm combined with Bayesian game is used to solve the above model, and the multi-agent optimal dispatching scheme for virtual power plants under multi-objective can be obtained. Simulation analysis indicates that, under the game model of controllable resources with incomplete information, the optimal dispatching scheme for virtual power plants can effectively improve economic efficiency and reduce carbon emissions. This strategy achieves a win-win outcome in terms of both economic and environmental multi-objective optimization.

HTML全文

参考文献(31)

施引文献

资源附件(0)