Home Energy Management Algorithm Based on Deep Reinforcement Learning Using Multistep Prediction

Naoki Kodama,Taku Harada,Kazuteru Miyazaki

doi:10.1109/access.2021.3126365

Abstract

In recent years, home energy management systems (HEMS), which enable the automatic control of electrical equipment and home appliances, have been attracting attention as a method for saving electricity at home. HEMS achieve energy saving by visualizing energy consumption at home and controlling energy consuming equipment such as air conditioners. The optimum control law is difficult to attain, owing to uncertainties related to power demand and power supply from the electrical equipment. Deep reinforcement learning has been used to address energy optimization problems for home environments. However, in HEMS, several components such as heating, ventilation, and air conditioning (HVAC) systems, storage batteries, and electric water heaters are simultaneously controlled, and therefore, the action space becomes extremely large. Therefore, it may not be feasible to fully learn the rare experience using traditional deep reinforcement learning methods due to the large size of the state-action space and slow propagation of delayed rewards. In this study, we propose an energy management algorithm that uses the Dual Targeting Algorithm to strongly learn the experience of acquiring high returns using the quick propagation of delayed rewards via multistep returns. The proposed energy management algorithm is applied to a HEMS learning experiment to control a storage battery and an HVAC system, and its performance is compared to that of a Deep Deterministic Policy Gradient-based energy management system. As a result, it is confirmed that the proposed method can reduce the number of hours deviating from the comfort temperature range by about 17% compared to the existing method.

Highlights

In recent years, there have been many attempts to save energy by visualizing the amount of energy consumption at home and controlling energy consuming equipment such as air conditioners
The behavior can be confirmed from the behavior of recovering after the learning performance is sometimes significantly reduced in the learning curve of the exploitationoriented DDPG (ExDDPG)-based energy management algorithm, as shown in fig. 3
Since the use of eligibility trace [23] and the use of arbitrary n-step return [24] have been proposed as a deep reinforcement learning method using multistep returns, a comparison between the ExDDPG-based energy management algorithm and energy management algorithms using these methods will be required in the future works

Summary

INTRODUCTION

There have been many attempts to save energy by visualizing the amount of energy consumption at home and controlling energy consuming equipment such as air conditioners. We propose an exploitationoriented DDPG (ExDDPG)-based energy management algorithm that strongly reinforces the experience of high daily returns by introducing the DTA into the DDPG-based energy management algorithm. In this algorithm, in learning for each state-action pair, when the multistep returns until the end of the day (that is 24 o’clock) is higher than the 1step returns, the multistep returns are used. We apply to the ExDDPG-based energy management algorithm to HEMS, which controls the ESS and HVAC systems, and verify its effectiveness in maintaining a comfortable room temperature and reducing electricity charges compared to the DDPG-based energy management algorithm. We detail the models used for the storage battery and the HVAC system and formulate sequential decision-making problems as Markov decision processes (MDPs)

SYSTEM MODEL The dynamics model of the storage battery is given by

MDP FORMULATION

M 1 M j

SIMULATION SETTING

EVALUATION METHOD

Method

Findings

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Home Energy Management Algorithm Based on Deep Reinforcement Learning Using Multistep Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Information provision system in a home energy and comfort management system for energy conservation
Kanae Matsui
-
Kanae MatsuiKanae Matsui
01 May 2016
01 May 2016

An Advanced Satisfaction-Based Home Energy Management System Using Deep Reinforcement Learning
Mohammad Rastegar ... Mohammad Jooshaki
IEEE Access | VOL. 10
Mohammad Rastegar, et. al.Mohammad Rastegar ... Mohammad Jooshaki
01 Jan 2021
IEEE Access | VOL. 10

Implementation of a dynamic energy management system using real time pricing and local renewable energy generation forecasts
A Tahir İnce ... Akın Taşcıkaraoğlu
Energy | VOL. 134
A Tahir İnce, et. al.A Tahir İnce ... Akın Taşcıkaraoğlu
05 Jun 2017
Energy | VOL. 134

Short-term HVAC load forecasting algorithms for home energy management
Jian Lu ... Xiaoyu Wu
-
Jian Lu, et. al. Jian Lu ... Xiaoyu Wu
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Home Energy Management Algorithm Based on Deep Reinforcement Learning Using Multistep Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access