Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control

The development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning p...

Full description

Bibliographic Details
Main Authors:	Wenya Xu, Yanxue Li, Guanjie He, Yang Xu, Weijun Gao
Format:	Article
Language:	English
Published:	MDPI AG 2023-06-01
Series:	Energies
Subjects:	reinforcement learning reward design battery storage PV consumption energy cost
Online Access:	https://www.mdpi.com/1996-1073/16/13/4844

_version_	1827734990892302336
author	Wenya Xu Yanxue Li Guanjie He Yang Xu Weijun Gao
author_facet	Wenya Xu Yanxue Li Guanjie He Yang Xu Weijun Gao
author_sort	Wenya Xu
collection	DOAJ
description	The development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning process generally presents low learning efficiency using real-world stochastic data. To address this challenge, this study proposes a model-based RL approach to optimize the operation of existing zero-energy houses considering PV generation consumption and energy costs. The model-based approach takes advantage of the inner understanding of the system dynamics; this knowledge improves the learning efficiency. A reward function is designed considering the physical constraints of battery storage, photovoltaic (PV) production feed-in profit, and energy cost. Measured data of a zero-energy house are used to train and test the proposed RL agent control, including <i>Q</i>-learning, deep <i>Q</i> network (DQN), and deep deterministic policy gradient (DDPG) agents. The results show that the proposed RL agents can achieve fast convergence during the training process. In comparison with the rule-based strategy, test cases verify the cost-effectiveness performances of proposed RL approaches in scheduling operations of the hybrid energy system under different scenarios. The comparative analysis of test periods shows that the DQN agent presents better energy cost-saving performances than <i>Q</i>-learning while the <i>Q</i>-learning agent presents more flexible action control of the battery with the fluctuation of real-time electricity prices. The DDPG algorithm can achieve the highest PV self-consumption ratio, 49.4%, and the self-sufficiency ratio reaches 36.7%. The DDPG algorithm outperforms rule-based operation by 7.2% for energy cost during test periods.
first_indexed	2024-03-11T01:42:58Z
format	Article
id	doaj.art-67fe6f40b4824213842c426a08f14d71
institution	Directory Open Access Journal
issn	1996-1073
language	English
last_indexed	2024-03-11T01:42:58Z
publishDate	2023-06-01
publisher	MDPI AG
record_format	Article
series	Energies
spelling	doaj.art-67fe6f40b4824213842c426a08f14d712023-11-18T16:26:41ZengMDPI AGEnergies1996-10732023-06-011613484410.3390/en16134844Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning ControlWenya Xu0Yanxue Li1Guanjie He2Yang Xu3Weijun Gao4Innovation Institute for Sustainable Maritime Architecture Research and Technology, Qingdao University of Technology, Qingdao 266033, ChinaInnovation Institute for Sustainable Maritime Architecture Research and Technology, Qingdao University of Technology, Qingdao 266033, ChinaInnovation Institute for Sustainable Maritime Architecture Research and Technology, Qingdao University of Technology, Qingdao 266033, ChinaInnovation Institute for Sustainable Maritime Architecture Research and Technology, Qingdao University of Technology, Qingdao 266033, ChinaInnovation Institute for Sustainable Maritime Architecture Research and Technology, Qingdao University of Technology, Qingdao 266033, ChinaThe development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning process generally presents low learning efficiency using real-world stochastic data. To address this challenge, this study proposes a model-based RL approach to optimize the operation of existing zero-energy houses considering PV generation consumption and energy costs. The model-based approach takes advantage of the inner understanding of the system dynamics; this knowledge improves the learning efficiency. A reward function is designed considering the physical constraints of battery storage, photovoltaic (PV) production feed-in profit, and energy cost. Measured data of a zero-energy house are used to train and test the proposed RL agent control, including <i>Q</i>-learning, deep <i>Q</i> network (DQN), and deep deterministic policy gradient (DDPG) agents. The results show that the proposed RL agents can achieve fast convergence during the training process. In comparison with the rule-based strategy, test cases verify the cost-effectiveness performances of proposed RL approaches in scheduling operations of the hybrid energy system under different scenarios. The comparative analysis of test periods shows that the DQN agent presents better energy cost-saving performances than <i>Q</i>-learning while the <i>Q</i>-learning agent presents more flexible action control of the battery with the fluctuation of real-time electricity prices. The DDPG algorithm can achieve the highest PV self-consumption ratio, 49.4%, and the self-sufficiency ratio reaches 36.7%. The DDPG algorithm outperforms rule-based operation by 7.2% for energy cost during test periods.https://www.mdpi.com/1996-1073/16/13/4844reinforcement learningreward designbattery storagePV consumptionenergy cost
spellingShingle	Wenya Xu Yanxue Li Guanjie He Yang Xu Weijun Gao Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control Energies reinforcement learning reward design battery storage PV consumption energy cost
title	Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control
title_full	Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control
title_fullStr	Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control
title_full_unstemmed	Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control
title_short	Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control
title_sort	performance assessment and comparative analysis of photovoltaic battery system scheduling in an existing zero energy house based on reinforcement learning control
topic	reinforcement learning reward design battery storage PV consumption energy cost
url	https://www.mdpi.com/1996-1073/16/13/4844
work_keys_str_mv	AT wenyaxu performanceassessmentandcomparativeanalysisofphotovoltaicbatterysystemschedulinginanexistingzeroenergyhousebasedonreinforcementlearningcontrol AT yanxueli performanceassessmentandcomparativeanalysisofphotovoltaicbatterysystemschedulinginanexistingzeroenergyhousebasedonreinforcementlearningcontrol AT guanjiehe performanceassessmentandcomparativeanalysisofphotovoltaicbatterysystemschedulinginanexistingzeroenergyhousebasedonreinforcementlearningcontrol AT yangxu performanceassessmentandcomparativeanalysisofphotovoltaicbatterysystemschedulinginanexistingzeroenergyhousebasedonreinforcementlearningcontrol AT weijungao performanceassessmentandcomparativeanalysisofphotovoltaicbatterysystemschedulinginanexistingzeroenergyhousebasedonreinforcementlearningcontrol

Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control

Similar Items