Symbolic Regression Methods for Reinforcement Learning

Jiri Kubalik,Robert Babuska,Jan Zegklitz,Erik Derner

doi:10.1109/access.2021.3119000

Jiri Kubalik, Robert Babuska + Show 2 more

Open Access

https://doi.org/10.1109/access.2021.3119000

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 7	License type: CC BY 4.0

Affiliation: Czech Technical University in Prague

Abstract

Reinforcement learning algorithms can solve dynamic decision-making and optimal control problems. With continuous-valued state and input variables, reinforcement learning algorithms must rely on function approximators to represent the value function and policy mappings. Commonly used numerical approximators, such as neural networks or basis function expansions, have two main drawbacks: they are black-box models offering little insight into the mappings learned, and they require extensive trial and error tuning of their hyper-parameters. In this paper, we propose a new approach to constructing smooth value functions in the form of analytic expressions by using symbolic regression. We introduce three off-line methods for finding value functions based on a state-transition model: symbolic value iteration, symbolic policy iteration, and a direct solution of the Bellman equation. The methods are illustrated on four nonlinear control problems: velocity control under friction, one-link and two-link pendulum swing-up, and magnetic manipulation. The results show that the value functions yield well-performing policies and are compact, mathematically tractable, and easy to plug into other algorithms. This makes them potentially suitable for further analysis of the closed-loop system. A comparison with an alternative approach using neural networks shows that our method outperforms the neural network-based one.

Highlights

R EINFORCEMENT learning (RL) in continuous-valued state and input spaces relies on function approximators
While the chosen test problems have low-dimensional input and state spaces, they are representative of challenging control problems as none of them can be solved by linear control methods
We have proposed three methods based on symbolic regression to construct an analytic approximation of the Vfunction in a Markov decision process

Summary

Introduction

R EINFORCEMENT learning (RL) in continuous-valued state and input spaces relies on function approximators. There are no guidelines on how to design good value function approximator and, as a consequence, a large amount of expert knowledge and haphazard tuning is required when applying RL techniques to continuous-valued problems. The interpolation properties of numerical function approximators may adversely affect the control performance and result in chattering control signals and steady-state errors [12]. This makes RL inferior to alternative control design methods, despite the theoretic potential of RL to produce optimal control policies [13]

Objectives

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Symbolic Regression Methods for Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Genetic programming methods for reinforcement learning
Robert Babuska
-
Robert BabuskaRobert Babuska
13 Jul 2019
13 Jul 2019

Policy derivation methods for critic-only reinforcement learning in continuous spaces
Eduard Alibekov ... Robert Babuška
Engineering Applications of Artificial Intelligence | VOL. 69
Eduard Alibekov, et. al.Eduard Alibekov ... Robert Babuška
08 Jan 2018
Engineering Applications of Artificial Intelligence | VOL. 69

Optimal Control via Reinforcement Learning with Symbolic Policy Approximation
Jiří Kubalík ... Robert Babuška
IFAC-PapersOnLine | VOL. 50
Jiří Kubalík, et. al.Jiří Kubalík ... Robert Babuška
01 Jul 2017
IFAC-PapersOnLine | VOL. 50

An analysis of value function learning with piecewise linear control
Onder Tutsoy ... Martin Brown
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 28
Onder Tutsoy, et. al.Onder Tutsoy ... Martin Brown
19 Mar 2015
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Symbolic Regression Methods for Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access