Comparison of online and offline deep reinforcement learning with model predictive control for thermal energy management

Silvio Brandi,Massimo Fiorentini,Alfonso Capozzoli

doi:10.1016/j.autcon.2022.104128

Abstract

This paper proposes a comparison between an online and offline Deep Reinforcement Learning (DRL) formulation with a Model Predictive Control (MPC) architecture for energy management of a cold-water buffer tank linking an office building and a chiller subject to time-varying energy prices, with the objective of minimizing operating costs. The intrinsic model-free approach of DRL is generally lost in common implementations for energy management, as they are usually pre-trained offline and require a surrogate model for this purpose. Simulation results showed that the online-trained DRL agent, while requiring an initial 4 weeks adjustment period achieving a relatively poor performance (160% higher cost), it converged to a control policy almost as effective as the model-based strategies (3.6% higher cost in the last month). This suggests that the DRL agent trained online may represent a promising solution to overcome the barrier represented by the modelling requirements of MPC and offline-trained DRL approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Automation in Construction	Publication Date: Jan 10, 2022
Citations: 43	License type: other-oa

R Discovery Prime

R Discovery Prime

Comparison of online and offline deep reinforcement learning with model predictive control for thermal energy management

Abstract

Talk to us

Similar Papers

More From: Automation in Construction

Lead the way for us

Similar Papers

Less is More
S Murugesan ... K H Drees
-
S Murugesan, et. al.S Murugesan ... K H Drees
17 Nov 2020
17 Nov 2020

Reactive Power Optimization of Distribution Network Based on Deep Reinforcement Learning and Multi Agent System
Zhi Gao ... Yang Yang
-
Zhi Gao, et. al.Zhi Gao ... Yang Yang
22 Oct 2021
22 Oct 2021

Control of superheat of organic Rankine cycle under transient heat source based on deep reinforcement learning
Xuan Wang ... Jiaying Pan
Applied Energy | VOL. 278
Xuan Wang, et. al.Xuan Wang ... Jiaying Pan
04 Aug 2020
Applied Energy | VOL. 278

Deep reinforcement learning for autonomous SideLink radio resource management in platoon-based C-V2X networks: An overview
Nessrine Trabelsi ... Wael Jaafar
Computer Networks | VOL. -
Nessrine Trabelsi, et. al.Nessrine Trabelsi ... Wael Jaafar
01 Nov 2024
Computer Networks | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of online and offline deep reinforcement learning with model predictive control for thermal energy management

Abstract

Talk to us

Similar Papers

More From: Automation in Construction