Lei Xi, Lipeng Zhou, Lang Liu, 等. A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids[J]. 中国电机工程学会电力与能源系统学报(英文), 2020,6(3):712-723.
Lei Xi, Lipeng Zhou, Lang Liu, et al. A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids[J]. CSEE Journal of Power and Energy Systems, 2020, 6(3): 712-723.
Lei Xi, Lipeng Zhou, Lang Liu, 等. A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids[J]. 中国电机工程学会电力与能源系统学报(英文), 2020,6(3):712-723. DOI: 10.17775/CSEEJPES.2019.01840.
Lei Xi, Lipeng Zhou, Lang Liu, et al. A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids[J]. CSEE Journal of Power and Energy Systems, 2020, 6(3): 712-723. DOI: 10.17775/CSEEJPES.2019.01840.
A deep reinforcement learning algorithm for the power order optimization allocation of AGC in interconnected power grids
The integration of distributed generations (solar power
wind power)
energy storage devices
and electric vehicles
causes unpredictable disturbances in power grids. It has become a top priority to coordinate the distributed generations
loads
and energy storages in order to better facilitate the utilization of new energy. Therefore
a novel algorithm based on deep reinforcement learning
namely the deep PDWoLF-PHC (policy dynamics based win or learn fast-policy hill climbing) network (DPDPN)
is proposed to allocate power order among the various generators. The proposed algorithm combines the decision mechanism of reinforcement learning with the prediction mechanism of a deep neural network to obtain the optimal coordinated control for the source-grid-load. Consequently it solves the problem brought by stochastic disturbances and improves the utilization rate of new energy. Simulations are conducted with the case of the improved IEEE two-area and a case in the Guangdong power grid. Results show that the adaptability and control performance of the power system are improved using the proposed algorithm as compared with using other existing strategies.