Monitoring data-driven Reinforcement Learning controller training: A comparative study of different training strategies for a real-world energy system

Thomas Schreiber,Christoph Netsch,Marc Baranski,Dirk Müller

doi:10.1016/j.enbuild.2021.110856

Abstract

With increasing complexity of building energy systems and rising shares of renewable energies in the grids, the requirements for building automation and control systems (BACS) are increasing. The use of storage systems enables the decoupling of energy demand and supply and to consider dynamic constraints in the control of the systems. The resulting optimization problem is very challenging to solve with the state-of-the-art rule-based-control (RBC) approach. Model Predictive Control (MPC) on the other hand allows a nearly optimal operation but comes with expensive modeling efforts and high computational costs. These drawbacks are contrasted by promising results from the field of Reinforcement Learning (RL). RL can be model-free, is highly adaptive and learns a policy by interacting with the controlled system. However, the literature also addresses a number of questions, to be answered before RL for BACS can be realized. One is the slow convergence of the training process, which makes the application of a pre-training strategy necessary. Therefore, we design and compare different pre-training work-flows for a real-world energy system, in a demand response scenario. We apply a data-driven approach, covering all aspects from raw monitoring data to the trained algorithm. The considered energy system consists of two compression chillers and an ice storage. The objective of the control task is to charge and discharge the storage with respect to dynamic constraints. We use machine learning models of the energy system to train and evaluate a state-of-the-art RL algorithm (DQN) under five different pre-training strategies. We compare, online and offline training and initialization of the RL controller together with a guiding RBC. We demonstrate that offline training with a guiding RBC provides stable learning and a RL controller that always outperforms this guiding RBC. Unguided exploration on the other hand leads to higher accumulated cost savings. Based on our findings, we derive recommendations for practical application and future research questions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Monitoring data-driven Reinforcement Learning controller training: A comparative study of different training strategies for a real-world energy system

Abstract

Talk to us

Similar Papers

More From: Energy and Buildings

Lead the way for us

Journal: Energy and Buildings	Publication Date: Mar 2, 2021
Citations: 11

Similar Papers

The increasing urbanization thesis-did “new immigrants” to the United States have a particular fondness for urban life?
Lowell E Gallaway ... Richard K Vedder
Explorations in Economic History | VOL. 8
Lowell E Gallaway, et. al.Lowell E Gallaway ... Richard K Vedder
01 Mar 1971
Explorations in Economic History | VOL. 8

Application of two promising Reinforcement Learning algorithms for load shifting in a cooling supply system
Thomas Schreiber ... Dirk Müller
Energy and Buildings | VOL. 229
Thomas Schreiber, et. al.Thomas Schreiber ... Dirk Müller
20 Sep 2020
Energy and Buildings | VOL. 229

A practically implementable reinforcement learning control approach by leveraging offset-free model predictive control
Hesam Hassanpour ... Brandon Corbett
Computers & Chemical Engineering | VOL. 181
Hesam Hassanpour, et. al.Hesam Hassanpour ... Brandon Corbett
23 Nov 2023
Computers & Chemical Engineering | VOL. 181

Building automation and control systems for office buildings: Technical insights for effective facility management - A literature review
S Van Roosmale ... A Audenaert
Journal of Building Engineering | VOL. 97
S Van Roosmale, et. al.S Van Roosmale ... A Audenaert
02 Oct 2024
Journal of Building Engineering | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monitoring data-driven Reinforcement Learning controller training: A comparative study of different training strategies for a real-world energy system

Abstract

Talk to us

Similar Papers

More From: Energy and Buildings