刘建树, 江岳文. 基于多智体强化学习的多风氢系统联合优化运行[J]. 现代电力, 2022, 39(4): 431-440. DOI: 10.19725/j.cnki.1007-2322.2021.0197
引用本文: 刘建树, 江岳文. 基于多智体强化学习的多风氢系统联合优化运行[J]. 现代电力, 2022, 39(4): 431-440. DOI: 10.19725/j.cnki.1007-2322.2021.0197
LIU Jianshu, JIANG Yuewen. Joint Optimal Operation of Multi Wind-Hydrogen System Based on Multi-Agent Reinforcement Learning[J]. Modern Electric Power, 2022, 39(4): 431-440. DOI: 10.19725/j.cnki.1007-2322.2021.0197
Citation: LIU Jianshu, JIANG Yuewen. Joint Optimal Operation of Multi Wind-Hydrogen System Based on Multi-Agent Reinforcement Learning[J]. Modern Electric Power, 2022, 39(4): 431-440. DOI: 10.19725/j.cnki.1007-2322.2021.0197

基于多智体强化学习的多风氢系统联合优化运行

Joint Optimal Operation of Multi Wind-Hydrogen System Based on Multi-Agent Reinforcement Learning

  • 摘要: 针对多风氢系统联合运行问题,提出一种基于多智体强化学习的多风氢系统联合优化运行方法,使得多风氢系统在有效消纳风电的同时实现联合收益最大化。首先,考虑风电场与制氢加氢站两者间通过合约交易方式联合运行,分别构建各自的运行模型;其次,以多风氢系统联合运行收益最大化为目标建立联合优化运行模型;再者,针对多风氢系统多决策变量导致的维数灾难问题,将多智体引入到强化学习中并采取多决策更新方法加速算法收敛;最后,通过算例仿真验证所提模型的合理性以及方法的可行性。

     

    Abstract: In allusion to the joint operation of multi wind-hydrogen system, based on multi-agent reinforcement learning a multi wind-hydrogen system joint optimization operation method was proposed to make the multi wind-hydrogen system enable to accommodate wind power effectively and meanwhile to maximize the joint revenue. Firstly, considering the joint operation of wind farm, hydrogen generation and hydrogenation station in the manner of contract transaction, respective operation models for them were constructed. Secondly, taking the maximized joint operation revenue of multi wind-hydrogen system as the objective, a joint optimization operation model was established. Thirdly, to cope with the dimension disaster caused by multi decision variables of multi wind-hydrogen system, the multi agent was led into the reinforcement learning and the method of multi-decision update was adopted to speed up the algorithm convergence. Finally, the reasonableness of the established model and the feasibility of the adopted method are verified by simulation example.

     

/

返回文章
返回