石文喆, 李冰洁, 尤培培, 张泠. 基于深度强化学习的建筑能源系统优化策略[J]. 中国电力, 2023, 56(6): 114-122. DOI: 10.11930/j.issn.1004-9649.202210043
引用本文: 石文喆, 李冰洁, 尤培培, 张泠. 基于深度强化学习的建筑能源系统优化策略[J]. 中国电力, 2023, 56(6): 114-122. DOI: 10.11930/j.issn.1004-9649.202210043
SHI Wenzhe, LI Bingjie, YOU Peipei, ZHANG Ling. Optimization Strategy of Building Energy System Based on Deep Reinforcement Learning[J]. Electric Power, 2023, 56(6): 114-122. DOI: 10.11930/j.issn.1004-9649.202210043
Citation: SHI Wenzhe, LI Bingjie, YOU Peipei, ZHANG Ling. Optimization Strategy of Building Energy System Based on Deep Reinforcement Learning[J]. Electric Power, 2023, 56(6): 114-122. DOI: 10.11930/j.issn.1004-9649.202210043

基于深度强化学习的建筑能源系统优化策略

Optimization Strategy of Building Energy System Based on Deep Reinforcement Learning

  • 摘要: 针对建筑能源系统中需求侧的负荷不确定性与供给侧的可再生能源随机性,提出一种基于深度强化学习的建筑能源系统管理优化策略。首先,搭建能源系统供需侧研究框架并建立设备模型。然后,将实时阶段下的建筑能源管理问题构建为马尔可夫决策过程,利用深度强化学习理论,以最小化用电成本、保证室内热舒适水平和最大化消纳可再生能源为优化目标,采用决斗双重深度Q网络算法进行训练,得到训练后的算法可以根据实时环境参数做出自适应控制决策。最后,通过在建筑能源系统案例中的应用,将该策略与传统的基于规则的控制策略相比较,结果表明,所提出的优化策略使用电成本降低11.03%,热不舒适时长降低89.62%,未消纳光伏发电量降低10.43%。

     

    Abstract: Aiming at the load uncertainty on the demand side of the building energy system and the randomness of renewable energy on the supply side, a building energy system management optimization strategy is proposed based on deep reinforcement learning. Firstly, a supply-demand side research framework for the energy system and device model is built. The building energy management problem under the real-time stage is constructed as Markov decision-making process, and the deep reinforcement learning theory is used to minimize the cost of electricity, ensure the indoor heat comfort level and maximize the consumption of renewable energy as the optimization goals, and the duel dual deep Q network algorithm is used for model training, and the trained model can make adaptive control decisions according to real-time environmental parameters. Finally, through the application in the building energy system case, the results show that the proposed optimization strategy reduces the cost of electricity by 11.03%, the duration of thermal discomfort by 89.62%, and the amount of unconsumed photovoltaic power generation by 10.43%, comparing with the traditional rule-based control strategy.

     

/

返回文章
返回