Evolutionary Dynamics of Q-Learning over the Sequence Form

Fabio Panozzo,Marcello Restelli,Nicola Gatti

doi:10.1609/aaai.v28i1.9012

Abstract

Multi-agent learning is a challenging open task in artificial intelligence. It is known an interesting connection between multi-agent learning algorithms and evolutionary game theory, showing that the learning dynamics of some algorithms can be modeled as replicator dynamics with a mutation term. Inspired by the recent sequence-form replicator dynamics, we develop a new version of the Q-learning algorithm working on the sequence form of an extensive-form game allowing thus an exponential reduction of the dynamics length w.r.t. those of the normal form. The dynamics of the proposed algorithm can be modeled by using the sequence-form replicator dynamics with a mutation term. We show that, although sequence-form and normal-form replicator dynamics are realization equivalent, the Q-learning algorithm applied to the two forms have non-realization equivalent dynamics. Originally from the previous works on evolutionary game theory models form multi-agent learning, we produce an experimental evaluation to show the accuracy of the model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evolutionary Dynamics of Q-Learning over the Sequence Form

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 21, 2014
Citations: 6

Similar Papers

Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games

-

04 Jun 2012
04 Jun 2012

Exponential moving average based multiagent reinforcement learning algorithms
Mostafa D Awheda ... Howard M Schwartz
Artificial Intelligence Review | VOL. 45
Mostafa D Awheda, et. al.Mostafa D Awheda ... Howard M Schwartz
19 Oct 2015
Artificial Intelligence Review | VOL. 45

Best-Response Multiagent Learning in Non-Stationary Environments
...
-
, et. al. ...
19 Jul 2004
19 Jul 2004

A common gradient in multi-agent reinforcement learning
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evolutionary Dynamics of Q-Learning over the Sequence Form

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence