侯慧, 何梓姻, 陈跃, 侯婷婷, 唐金锐, 吴细秀. 基于深度强化学习区间多目标优化的智能建筑低碳优化调度[J]. 电力系统自动化, 2023, 47(21): 47-57.
引用本文: 侯慧, 何梓姻, 陈跃, 侯婷婷, 唐金锐, 吴细秀. 基于深度强化学习区间多目标优化的智能建筑低碳优化调度[J]. 电力系统自动化, 2023, 47(21): 47-57.
HOU Hui, HE Ziyin, CHEN Yue, HOU Tingting, TANG Jinrui, WU Xixiu. Low-carbon Optimal Dispatch of Smart Building Based on Interval Multi-objective Optimization with Deep Reinforcement Learning[J]. Automation of Electric Power Systems, 2023, 47(21): 47-57.
Citation: HOU Hui, HE Ziyin, CHEN Yue, HOU Tingting, TANG Jinrui, WU Xixiu. Low-carbon Optimal Dispatch of Smart Building Based on Interval Multi-objective Optimization with Deep Reinforcement Learning[J]. Automation of Electric Power Systems, 2023, 47(21): 47-57.

基于深度强化学习区间多目标优化的智能建筑低碳优化调度

Low-carbon Optimal Dispatch of Smart Building Based on Interval Multi-objective Optimization with Deep Reinforcement Learning

  • 摘要: 为充分挖掘并有效利用建筑的节能减排潜力,提出基于深度强化学习区间多目标优化的智能建筑低碳优化调度方法。首先,采取区间数等方法对系统中机组参数、建筑温度及源荷多重不确定性等进行建模。其次,计及系统碳排放与碳交易机制,以综合运行成本最低及用户舒适度最优为目标,优化系统运行。为解决区间多目标优化问题,提出一种将深度强化学习的深度Q网络与区间多目标粒子群优化算法耦合,进行“离线训练”与“在线指导”的新型多目标优化算法,高效求解多重不确定性因素下的智能建筑低碳优化调度问题。算例结果表明,所提深度强化学习区间多目标优化算法及调度模型能兼顾系统低碳性、经济性与用户舒适性等,同时有效提高系统应对多重不确定性因素的能力。

     

    Abstract: In order to fully tap and effectively utilize the energy saving and emission reduction potential of buildings, a low-carbon optimal dispatch method of smart buildings based on interval multi-objective optimization with deep reinforcement learning is proposed. Firstly, the unit parameters, building temperature and multiple uncertainties of source and load in the system are modeled by using interval number and other methods. Secondly, taking into account the system carbon emission and carbon trading mechanism, the system operation is optimized with the goal of the lowest comprehensive operation cost and the best user comfort.In order to solve the interval multi-objective optimization problem, a new multi-objective optimization algorithm is proposed,which combines deep Q network of deep reinforcement learning and interval multi-objective particle swarm optimization algorithm,and performs “offline training” and “online guidance” to efficiently solve the low-carbon optimization dispatch problem of smart buildings under multiple uncertainties. The case results show that the proposed interval multi-objective optimization algorithm with deep reinforcement learning and dispatch model can take into account the low carbon, economy and user comfort of the system,and effectively improve the ability of the system to deal with multiple uncertainties.

     

/

返回文章
返回