OU Jieyu, ZHANG Yi, XIN Rong, et al. Decentralized Coordinated Control Strategy for Power Quality in Low Voltage Distribution Networks Based on Multi-agent Deep Reinforcement Learning[J]. 2025, (21): 8308-8322.
OU Jieyu, ZHANG Yi, XIN Rong, et al. Decentralized Coordinated Control Strategy for Power Quality in Low Voltage Distribution Networks Based on Multi-agent Deep Reinforcement Learning[J]. 2025, (21): 8308-8322. DOI: 10.13334/j.0258-8013.pcsee.241754.
基于多智能体深度强化学习的低压配电网电能质量分散式协同调控策略
摘要
随着新型负荷与分布式光伏的广泛接入,低压配电网中电能质量问题日渐凸显。既有电能质量综合调控策略具有难以适应低压配电网通讯条件薄弱、设备计算能力有限与电能质量问题复杂多样等特征,无法满足低压配电网的电能质量治理需求。为此,该文提出一种基于多智能体深度强化学习的低压配电网电能质量分散式协同调控策略。首先,统筹兼顾低压配电网以上典型特征,将电能质量协同调控问题建立为分散式部分可观的马尔科夫决策过程;其次,为解决样本效率低下问题,设计改进深度密连强化学习(modified deep dense reinforcement learning,MD2RL)结构与MATD3-MD2RL算法;再次,引入集中式训练-分散式执行架构,在保证分散式观测的同时实现多智能体协同运行;最后,在改进IEEE-13节点系统与实际低压配电网中进行算例测试。结果表明,所提调控策略可有效改善低压配电网的电能质量,同时具有无通讯、实时性强且不依赖于精确潮流模型等优势。
Abstract
With the extensive integration of novel load demands and distributed PV generations
power quality issues are becoming more prevalent in low voltage distribution networks (LVDNs). Existing control strategies struggle to adapt to LVDNs
which features weak communication conditions
limited computational capacity
and complex power quality issues
thereby failing to meet the power quality needs in LVDNs. Therefore
this paper proposes a decentralized coordinated power quality control strategy for LVDNs based on multi-agent deep reinforcement learning (MADRL). First
considering the typical characteristics of LVDNs
the power quality coordinated control problem is transformed into a decentralized partially observable Markov decision process. Secondly
to address the low sample efficiency
the modified deep dense reinforcement learning (MD2RL) structure and the MATD3-MD2RL algorithm are designed. Then
a centralized training-decentralized execution framework is introduced to ensure decentralized observations while achieving multi-agent coordination. Finally
case studies are conducted on the IEEE 13-bus system and a practical LVDN. The results demonstrate that the proposed control strategy improves the power quality of LVDNs with additional advantages such as independence from communication