基于改进双延迟深度确定性策略梯度算法的微电网低碳经济优化调度策略

宋金威; 何悠悠; 孙大帅; 董臣臣; 王景龙

doi:10.19725/j.cnki.1007-2322.2024.0063

基于改进双延迟深度确定性策略梯度算法的微电网低碳经济优化调度策略

A Low-carbon Economic Optimization Dispatching Strategy for Microgrids Based on Improved Twin Delayed Deep Deterministic Policy Gradient Algorithm

摘要

摘要: 随着中国“双碳”战略的大力实施，新能源逐渐进入公众视野，微电网的出现有利于新能源的快速消纳，微电网调度与优化问题也逐渐成为研究热点。采用传统方法进行微电网低碳经济调度会出现数据维度高、建模难度大、灵活性低、运行时间长等问题。针对这些问题，提出一种基于改进双延迟深度确定性策略梯度算法(twin delayed deep deterministic policy gradient，TD3)的微电网低碳经济优化调度策略。首先，对模型预测控制(model predictive control，MPC)的策略结果进行统计分析，将得出的储能电池运行规律用于TD3算法自学习规则的改进，有效缩减模型探索区域；其次，引入双缓存区技术，改善样本失衡问题，实现效率和鲁棒性的双重提升；最后，通过对某示范微电网的算例进行分析，验证所提算法模型的有效性。

Abstract: With the vigorous implementation of China’s carbon peaking and carbon neutrality goals, renewable energy has gradually come into the public spotlight. The emergence of microgrids is conducive to the rapid absorption of renewable energy, making microgrid dispatching and optimization a research focal point. Traditional methods for low-carbon economic dispatching of microgrids encounter various challenges such as high data dimensionality, complex modeling, limited flexibility, and extended operation times. To address these challenges, a low-carbon economic optimization dispatching strategy for microgrids is proposed based on an improved twin delayed deep deterministic (TD3) policy gradient algorithm. Firstly, a statistical analysis on the strategy results of model predictive control (MPC) is conducted. The obtained operation rules of energy storage batteries are utilized to refine the self-learning rules of the TD3 algorithm, thereby effectively reducing the model’s exploration area. Secondly, the dual buffer technique is introduced to address the issue of sample imbalance, achieving a dual enhancement in efficiency and robustness. Finally, the effectiveness of the proposed algorithm model is verified through the analysis of a case study of a demonstration microgrid.

HTML全文

参考文献(22)

施引文献

资源附件(0)