Continuous state/action reinforcement learning: A growing self-organizing map approach

Hesam Montazeri,Sajjad Moradi,Reza Safabakhsh

doi:10.1016/j.neucom.2010.11.012

Abstract

This paper proposes an algorithm to deal with continuous state/action space in the reinforcement learning (RL) problem. Extensive studies have been done to solve the continuous state RL problems, but more research should be carried out for RL problems with continuous action spaces. Due to non-stationary, very large size, and continuous nature of RL problems, the proposed algorithm uses two growing self-organizing maps (GSOM) to elegantly approximate the state/action space through addition and deletion of neurons. It has been demonstrated that GSOM has a better performance in topology preservation, quantization error reduction, and non-stationary distribution approximation than the standard SOM. The novel algorithm proposed in this paper attempts to simultaneously find the best representation for the state space, accurate estimation of Q-values, and appropriate representation for highly rewarded regions in the action space. Experimental results on delayed reward, non-stationary, and large-scale problems demonstrate very satisfactory performance of the proposed algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous state/action reinforcement learning: A growing self-organizing map approach

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Dec 28, 2010
Citations: 21

Similar Papers

Efficient reinforcement learning in continuous state and action spaces with Dyna and policy approximation
Shan Zhong ... Quan Liu
Frontiers of Computer Science | VOL. 13
Shan Zhong, et. al.Shan Zhong ... Quan Liu
13 Feb 2018
Frontiers of Computer Science | VOL. 13

Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
Juan C Santamaria ... Ashwin Ram
Adaptive Behavior | VOL. 6
Juan C Santamaria, et. al.Juan C Santamaria ... Ashwin Ram
01 Sep 1997
Adaptive Behavior | VOL. 6

Deep Near Unsupervised Learning for Data Analysis in Metabolomics, Drug-Drug Interaction Discovery and Human Gait Recognition
Saman K Halgamuge
-
Saman K HalgamugeSaman K Halgamuge
01 Jan 2015
01 Jan 2015

A neural field approach to topological reinforcement learning in continuous action spaces
H.-M Gross ... V Stephan
-
H.-M Gross, et. al.H.-M Gross ... V Stephan
04 May 1998
04 May 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous state/action reinforcement learning: A growing self-organizing map approach

Abstract

Talk to us

Similar Papers

More From: Neurocomputing