Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces

Daniel Hein,Thomas A Runkler,Alexander Hentschel,Steffen Udluft

doi:10.4018/ijsir.2016070102

Abstract

This article introduces a model-based reinforcement learning (RL) approach for continuous state and action spaces. While most RL methods try to find closed-form policies, the approach taken here employs numerical on-line optimization of control action sequences. First, a general method for reformulating RL problems as optimization tasks is provided. Subsequently, Particle Swarm Optimization (PSO) is applied to search for optimal solutions. This Particle Swarm Optimization Policy (PSO-P) is effective for high dimensional state spaces and does not require a priori assumptions about adequate policy representations. Furthermore, by translating RL problems into optimization tasks, the rich collection of real-world inspired RL benchmarks is made available for benchmarking numerical optimization techniques. The effectiveness of PSO-P is demonstrated on the two standard benchmarks: mountain car and cart pole.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces

Abstract

Talk to us

Similar Papers

More From: International Journal of Swarm Intelligence Research

Lead the way for us

Journal: International Journal of Swarm Intelligence Research	Publication Date: Jul 1, 2016
Citations: 24

Similar Papers

Particle Swarm Optimization for Model Predictive Control in Reinforcement Learning Environments
Daniel Hein ... Steffen Udluft
-
Daniel Hein, et. al.Daniel Hein ... Steffen Udluft
01 Jan 2018
01 Jan 2018

Reinforcement learning combined with human feedback in continuous state and action spaces
Ngo Anh Vien ... Wolfgang Ertel
-
Ngo Anh Vien, et. al. Ngo Anh Vien ... Wolfgang Ertel
01 Nov 2012
01 Nov 2012

Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
Juan C Santamaria ... Ashwin Ram
Adaptive Behavior | VOL. 6
Juan C Santamaria, et. al.Juan C Santamaria ... Ashwin Ram
01 Sep 1997
Adaptive Behavior | VOL. 6

Learning via human feedback in continuous state and action spaces
Ngo Anh Vien ... Wolfgang Ertel
Applied Intelligence | VOL. 39
Ngo Anh Vien, et. al.Ngo Anh Vien ... Wolfgang Ertel
02 Feb 2013
Applied Intelligence | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces

Abstract

Talk to us

Similar Papers

More From: International Journal of Swarm Intelligence Research