From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning

Stefan Elfwing,Eiji Uchibe,Kenji Doya

doi:10.1016/j.neunet.2016.07.013

Abstract

Free-energy based reinforcement learning (FERL) was proposed for learning in high-dimensional state and action spaces. However, the FERL method does only really work well with binary, or close to binary, state input, where the number of active states is fewer than the number of non-active states. In the FERL method, the value function is approximated by the negative free energy of a restricted Boltzmann machine (RBM). In our earlier study, we demonstrated that the performance and the robustness of the FERL method can be improved by scaling the free energy by a constant that is related to the size of network. In this study, we propose that RBM function approximation can be further improved by approximating the value function by the negative expected energy (EERL), instead of the negative free energy, as well as being able to handle continuous state input. We validate our proposed method by demonstrating that EERL: (1) outperforms FERL, as well as standard neural network and linear function approximation, for three versions of a gridworld task with high-dimensional image state input; (2) achieves new state-of-the-art results in stochastic SZ-Tetris in both model-free and model-based learning settings; and (3) significantly outperforms FERL and standard neural network function approximation for a robot navigation task with raw and noisy RGB images as state input and a large number of actions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Networks	Publication Date: Aug 26, 2016
Citations: 17	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Expected energy-based restricted Boltzmann machine for classification
S Elfwing ... K Doya
Neural Networks | VOL. 64
S Elfwing, et. al.S Elfwing ... K Doya
28 Sep 2014
Neural Networks | VOL. 64

Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces
Stefan Elfwing ... Eiji Uchibe
Frontiers in Neurorobotics | VOL. 7
Stefan Elfwing, et. al.Stefan Elfwing ... Eiji Uchibe
01 Jan 2013
Frontiers in Neurorobotics | VOL. 7

Guaranteed Globally Optimal continuous Reinforcement Learning
Hildo Bijl ... Jan Albert Mulder
-
Hildo Bijl, et. al.Hildo Bijl ... Jan Albert Mulder
10 Jan 2014
10 Jan 2014

A Survey of Linear Value Function Approximation in Reinforcement Learning
Shicheng Guo ... Bo Wei
-
Shicheng Guo, et. al.Shicheng Guo ... Bo Wei
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Neural Networks