Interpretable policies for reinforcement learning by genetic programming

Daniel Hein,Steffen Udluft,Thomas A Runkler

doi:10.1016/j.engappai.2018.09.007

Abstract

The search for interpretable reinforcement learning policies is of high academic and industrial interest. Especially for industrial systems, domain experts are more likely to deploy autonomously learned controllers if they are understandable and convenient to evaluate. Basic algebraic equations are supposed to meet these requirements, as long as they are restricted to an adequate complexity. Here we introduce the genetic programming for reinforcement learning (GPRL) approach based on model-based batch reinforcement learning and genetic programming, which autonomously learns policy equations from pre-existing default state–action trajectory samples. GPRL is compared to a straightforward method which utilizes genetic programming for symbolic regression, yielding policies imitating an existing well-performing, but non-interpretable policy. Experiments on three reinforcement learning benchmarks, i.e., mountain car, cart–pole balancing, and industrial benchmark, demonstrate the superiority of our GPRL approach compared to the symbolic regression method. GPRL is capable of producing well-performing interpretable reinforcement learning policies from pre-existing default trajectory data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interpretable policies for reinforcement learning by genetic programming

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Sep 22, 2018
Citations: 89

Similar Papers

Genetic Programming for Symbolic Regression: A Study on Fish Weight Prediction
Yunhan Yang ... Mengjie Zhang
-
Yunhan Yang, et. al.Yunhan Yang ... Mengjie Zhang
28 Jun 2021
28 Jun 2021

Generating interpretable reinforcement learning policies using genetic programming
Daniel Hein ... Steffen Udluft
-
Daniel Hein, et. al.Daniel Hein ... Steffen Udluft
13 Jul 2019
13 Jul 2019

Local Optimization Often is Ill-conditioned in Genetic Programming for Symbolic Regression
Gabriel Kronberger
-
Gabriel KronbergerGabriel Kronberger
01 Sep 2022
01 Sep 2022

Multi-objective genetic programming for symbolic regression with the adaptive weighted splines representation
Christian Raymond ... Mengjie Zhang
-
Christian Raymond, et. al.Christian Raymond ... Mengjie Zhang
07 Jul 2021
07 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interpretable policies for reinforcement learning by genetic programming

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence