Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control

Wenya Xu,Guanjie He,Yang Xu,Weijun Gao,Yanxue Li

doi:10.3390/en16134844

Wenya Xu, Guanjie He + Show 3 more

Open Access

https://doi.org/10.3390/en16134844

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

The development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning process generally presents low learning efficiency using real-world stochastic data. To address this challenge, this study proposes a model-based RL approach to optimize the operation of existing zero-energy houses considering PV generation consumption and energy costs. The model-based approach takes advantage of the inner understanding of the system dynamics; this knowledge improves the learning efficiency. A reward function is designed considering the physical constraints of battery storage, photovoltaic (PV) production feed-in profit, and energy cost. Measured data of a zero-energy house are used to train and test the proposed RL agent control, including Q-learning, deep Q network (DQN), and deep deterministic policy gradient (DDPG) agents. The results show that the proposed RL agents can achieve fast convergence during the training process. In comparison with the rule-based strategy, test cases verify the cost-effectiveness performances of proposed RL approaches in scheduling operations of the hybrid energy system under different scenarios. The comparative analysis of test periods shows that the DQN agent presents better energy cost-saving performances than Q-learning while the Q-learning agent presents more flexible action control of the battery with the fluctuation of real-time electricity prices. The DDPG algorithm can achieve the highest PV self-consumption ratio, 49.4%, and the self-sufficiency ratio reaches 36.7%. The DDPG algorithm outperforms rule-based operation by 7.2% for energy cost during test periods.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Energies	Publication Date: Jun 21, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control

Abstract

Published Version

Talk to us

Similar Papers

More From: Energies

Lead the way for us

Similar Papers

Reinforcement Learning-Based Energy Management Control Strategy of Hybrid Electric Vehicles
Fei Chen ... Bin Xu
-
Fei Chen, et. al.Fei Chen ... Bin Xu
08 Apr 2022
08 Apr 2022

Dynamic Resource Allocation of Reinforcement Learning Based on Neural Networks in Software Defined Networks
Xinjiu Xie
Procedia Computer Science | VOL. 243
Xinjiu XieXinjiu Xie
01 Jan 2024
Procedia Computer Science | VOL. 243

Trajectory tracking control of wheeled mobile robot based on improved LSTM-DDPG algorithm
Wenyao Gou ... Yan Liu
Journal of Physics: Conference Series | VOL. 2303
Wenyao Gou, et. al.Wenyao Gou ... Yan Liu
01 Jul 2022
Journal of Physics: Conference Series | VOL. 2303

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control

Abstract

Published Version

Talk to us

Similar Papers

More From: Energies