Effects of Sampling and Prediction Horizon in Reinforcement Learning

Pavel Osinenko,Dmitrii Dobriborsci

doi:10.1109/access.2021.3112498

Pavel Osinenko, Dmitrii Dobriborsci

Open Access

https://doi.org/10.1109/access.2021.3112498

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 5	License type: CC BY 4.0

Affiliation: Skolkovo Institute of Science and Technology

Abstract

Plain reinforcement learning (RL) may be prone to loss of convergence, constraint violation, unexpected performance, etc. Commonly, RL agents undergo extensive learning stages to achieve proper functionality. This is in contrast to classical control algorithms, which are typically model-based. A direction of research is the fusion of RL with such algorithms, especially model-predictive control (MPC). This, however, introduces new hyper-parameters related to the prediction horizon. Furthermore, RL is usually concerned with Markov decision processes. Nevertheless, most of the real environments are not time-discrete. The factual physical setting of RL consists of a digital agent and a time-continuous dynamical system. There is thus, in fact, yet another hyper-parameter – the agent sampling time. In this paper, we investigate the effects of prediction horizon and sampling of two hybrid RL-MPC agents in a case study with a mobile robot parking, which is, in turn, a canonical control problem. We benchmark the agents with a simple variant of MPC. The sampling showed a “sweet spot” behavior, whereas the RL agents demonstrated merits at shorter horizons.

Highlights

R EINFORCEMENT Learning (RL) shows remarkable performance in playground settings of video- and table games such as Starcraft, chess and Go [1]–[3]
Industry-close applications appear more challenging to RL due to the lack of freedom in training [4]–[8]
Industry is dominated by classical control-theoretic methods such as model-predictive control (MPC) [11]–[13]

Summary

Introduction

R EINFORCEMENT Learning (RL) shows remarkable performance in playground settings of video- and table games such as Starcraft, chess and Go [1]–[3]. Industry-close applications appear more challenging to RL due to the lack of freedom in training [4]–[8]. This may be related to limited resources and technical constraints. Industry is dominated by classical control-theoretic methods such as model-predictive control (MPC) [11]–[13]. Somewhat in contrast to the classical control, RL is aimed at a learning-based, model-free (in some configurations) approach. It is perhaps the model-based formal guarantees that make classical control attractive to the industry. This work goes along the lines of fusion of RL with predictive controls and addresses the tuning of the latter

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effects of Sampling and Prediction Horizon in Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Reinforcement learning-based real-time intelligent energy management for hybrid electric vehicles in a model predictive control framework
Ningkang Yang ... Lijin Han
Energy | VOL. 270
Ningkang Yang, et. al.Ningkang Yang ... Lijin Han
01 May 2023
Energy | VOL. 270

Multi-step Greedy Reinforcement Learning Based on Model Predictive Control
Yucheng Yang ... Sergio Lucia
IFAC-PapersOnLine | VOL. 54
Yucheng Yang, et. al.Yucheng Yang ... Sergio Lucia
01 Jan 2020
IFAC-PapersOnLine | VOL. 54

A Hybrid Reinforcement Learning-MPC Approach for Distribution System Critical Load Restoration
Abinet Tesfaye Eseye ... Matthew Reynolds
-
Abinet Tesfaye Eseye, et. al.Abinet Tesfaye Eseye ... Matthew Reynolds
17 Jul 2022
17 Jul 2022

Learning-Based Predictive Control for Discrete-Time Nonlinear Systems With Stochastic Disturbances.
Xin Xu ... Dazi Li
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Xin Xu, et. al.Xin Xu ... Dazi Li
09 May 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effects of Sampling and Prediction Horizon in Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access