Unified reinforcement Q-learning for mean field game and control problems

Andrea Angiuli,Jean-Pierre Fouque,Mathieu Laurière

doi:10.1007/s00498-021-00310-1

Abstract

We present a Reinforcement Learning (RL) algorithm to solve infinite horizon asymptotic Mean Field Game (MFG) and Mean Field Control (MFC) problems. Our approach can be described as a unified two-timescale Mean Field Q-learning: The same algorithm can learn either the MFG or the MFC solution by simply tuning the ratio of two learning parameters. The algorithm is in discrete time and space where the agent not only provides an action to the environment but also a distribution of the state in order to take into account the mean field feature of the problem. Importantly, we assume that the agent cannot observe the population’s distribution and needs to estimate it in a model-free manner. The asymptotic MFG and MFC problems are also presented in continuous time and space, and compared with classical (non-asymptotic or stationary) MFG and MFC problems. They lead to explicit solutions in the linear-quadratic (LQ) case that are used as benchmarks for the results of our algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unified reinforcement Q-learning for mean field game and control problems

Abstract

Talk to us

Similar Papers

More From: Mathematics of Control, Signals, and Systems

Lead the way for us

Journal: Mathematics of Control, Signals, and Systems	Publication Date: Jan 15, 2022
Citations: 16

Similar Papers

Deep learning for mean field optimal transport
Sebastian Baudelet ... Yuchen Zhu
ESAIM: Proceedings and Surveys | VOL. 77
Sebastian Baudelet, et. al.Sebastian Baudelet ... Yuchen Zhu
01 Jan 2024
ESAIM: Proceedings and Surveys | VOL. 77

Convergence analysis of machine learning algorithms for the numerical solution of mean field control and games: II—the finite horizon case
René Carmona ... Mathieu Laurière
The Annals of Applied Probability | VOL. 32
René Carmona, et. al.René Carmona ... Mathieu Laurière
01 Dec 2022
The Annals of Applied Probability | VOL. 32

A machine learning framework for solving high-dimensional mean field game and mean field control problems
Lars Ruthotto ... Samy Wu Fung
Proceedings of the National Academy of Sciences | VOL. 117
Lars Ruthotto, et. al.Lars Ruthotto ... Samy Wu Fung
09 Apr 2020
Proceedings of the National Academy of Sciences | VOL. 117

Numerical methods for mean field games and mean field type control
Mathieu Laurière
-
Mathieu LaurièreMathieu Laurière
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unified reinforcement Q-learning for mean field game and control problems

Abstract

Talk to us

Similar Papers

More From: Mathematics of Control, Signals, and Systems