杨挺, 赵黎媛, 刘亚闯, 冯少康, 盆海波. 基于深度强化学习的综合能源系统动态经济调度[J]. 电力系统自动化, 2021, 45(5): 39-47.
引用本文: 杨挺, 赵黎媛, 刘亚闯, 冯少康, 盆海波. 基于深度强化学习的综合能源系统动态经济调度[J]. 电力系统自动化, 2021, 45(5): 39-47.
YANG Ting, ZHAO Liyuan, LIU Yachuang, FENG Shaokang, PEN Haibo. Dynamic Economic Dispatch for Integrated Energy System Based on Deep Reinforcement Learning[J]. Automation of Electric Power Systems, 2021, 45(5): 39-47.
Citation: YANG Ting, ZHAO Liyuan, LIU Yachuang, FENG Shaokang, PEN Haibo. Dynamic Economic Dispatch for Integrated Energy System Based on Deep Reinforcement Learning[J]. Automation of Electric Power Systems, 2021, 45(5): 39-47.

基于深度强化学习的综合能源系统动态经济调度

Dynamic Economic Dispatch for Integrated Energy System Based on Deep Reinforcement Learning

  • 摘要: 综合能源系统的优化调度对于实现系统的多能互补和经济运行具有重要意义。然而,系统中可再生能源的间歇性以及用户用能需求的不确定性造成了系统中供需双方的随机波动,传统的调度方法难以准确地适应实际环境的动态变化。针对这一问题,提出了一种考虑可再生能源和负荷时变特性的综合能源系统动态经济调度方法。首先对综合能源系统动态经济调度问题进行数学描述,然后将该调度决策问题表述为强化学习框架,定义了系统的观测状态、调度动作和奖励函数,继而采用深度确定性策略梯度算法进行连续状态和动作空间下的动态调度决策。所提方法不需要对不确定性进行预测或建模,能够动态地对源和荷的随机波动做出响应。最后通过算例仿真验证了所提方法的有效性。

     

    Abstract: The optimal dispatch of integrated energy systems is of great significance for the realization of multi-energy complementary and economic operation of the system. However, the intermittence of renewable energy and the uncertainty of users’ energy demands in the system cause the random fluctuation on both the supply and demand sides in the system. Traditional dispatch methods are difficult to adapt to the dynamic changes of the actual environment accurately. In accordance to this problem,a dynamic economic dispatch method for integrated energy systems considering the time-varying characteristics of renewable energy and heterogeneous loads is proposed. Firstly, the dynamic economic dispatch problem for integrated energy systems is described mathematically. Secondly, the dispatch decision problem is formulated as a reinforcement learning framework, in which the observation state, dispatch action and reward function of the system are defined. Then, deep deterministic policy gradient(DDPG) algorithm is used to make dynamic dispatch decisions in continuous state and action spaces. The proposed method does not need to predict or model the uncertainty, and can dynamically respond to the random fluctuations of the source and loads.Finally, simulation is carried out to demonstrate the effectiveness of the proposed method.

     

/

返回文章
返回