A reinforcement learning approach to coordinate exploration with limited communication in continuous action games

Abdel Rodríguez,Peter Vrancx,Ricardo Grau,Ann Nowé

doi:10.1017/s026988891500020x

Abstract

AbstractLearning automata are reinforcement learners belonging to the class of policy iterators. They have already been shown to exhibit nice convergence properties in a wide range of discrete action game settings. Recently, a new formulation for a continuous action reinforcement learning automata (CARLA) was proposed. In this paper, we study the behavior of these CARLA in continuous action games and propose a novel method for coordinated exploration of the joint-action space. Our method allows a team of independent learners, using CARLA, to find the optimal joint action in common interest settings. We first show that independent agents using CARLA will converge to a local optimum of the continuous action game. We then introduce a method for coordinated exploration which allows the team of agents to find the global optimum of the game. We validate our approach in a number of experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A reinforcement learning approach to coordinate exploration with limited communication in continuous action games

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review

Lead the way for us

Journal: The Knowledge Engineering Review	Publication Date: Jan 1, 2016
Citations: 3

Similar Papers

Applying continuous action reinforcement learning automata(CARLA) to global training of hidden Markov models
J Kabudian ... M.R Meybodi
-
J Kabudian, et. al.J Kabudian ... M.R Meybodi
01 Jan 2004
01 Jan 2004

The application of continuous action reinforcement learning automata to adaptive PID tuning
M.N Howell
-
M.N HowellM.N Howell
01 Jan 1999
01 Jan 1999

Multivariable System Identification Method Based on Continuous Action Reinforcement Learning Automata
Meiying Jiang ... Qibing Jin
Processes | VOL. 7
Meiying Jiang, et. al.Meiying Jiang ... Qibing Jin
17 Aug 2019
Processes | VOL. 7

On-line PID tuning for engine idle-speed control using continuous action reinforcement learning automata
M.N Howell ... M.C Best
Control Engineering Practice | VOL. 8
M.N Howell, et. al.M.N Howell ... M.C Best
17 Jan 2000
Control Engineering Practice | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A reinforcement learning approach to coordinate exploration with limited communication in continuous action games

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review