基于双Q学习的考虑暂态稳定约束的电网薄弱线路辨识
Double Q-learning Based Identification of Weak Lines in Power Grid Considering Transient Stability Constraints
-
摘要: 为了辨识故障后对电网暂态稳定性影响较大的薄弱线路,提出一种基于双Q学习(doubleQ-learning,DQL)的考虑暂态稳定约束的电网薄弱线路辨识方法。首先,将电网薄弱线路辨识问题转化成马尔克夫决策过程。接着,利用DQL智能体结合电网时域暂态仿真计算,通过强化学习筛选出容易导致电网失稳的切线故障。最后,提出线路薄弱度指标,计算得到考虑暂态稳定约束的电网薄弱线路。所采用的DQL在Q学习的基础上增加了Q目标表,实现切线故障选择与评估的分离,并采用优先级采样的经验回放机制,能提升算法的稳定性与训练速度。10机39节点和16机68节点系统的仿真结果都表明,所提基于DQL算法的薄弱线路辨识方法,能通过较少的仿真次数,有效地辨识出考虑系统暂态安全稳定约束的薄弱线路。Abstract: This paper proposes a novel identification method of weak lines in power grid based on double Q-learning(DQL) algorithm. Firstly, the Markov decision process was formulated for the weak lines identification problem considering the power grid transient stability. Then, the DQL was adopted to decide the line fault that most likely lead to the transient instability through the interaction with power grid transient simulation. Finally, a line weakness index based on the Q values was proposed to figure out the weak lines which have great influence on the transient security and stability of power grid. In order to improve the stability and training speed of the algorithm, the Q target table was introduced to DQL to separate the selection and assessment of line fault, and the prioritized experience replay was also adopted. The simulation results of 10-machine-39-bus system and 16-machine-68-bus system showed that the proposed weak line identification method based on DQL algorithm can effectively identify the weak lines of power system considering the transient stability constraint with fewer simulation times.