Q‐learning for continuous‐time graphical games on large networks with completely unknown linear system dynamics

Kyriakos G Vamvoudakis

doi:10.1002/rnc.3719

Abstract

SummaryIn this paper, we consider the problem of leader synchronization in systems with interacting agents in large networks while simultaneously satisfying energy‐related user‐defined distributed optimization criteria. But modeling in large networks is very difficult, and for that reason, we derive a model‐free formulation that is based on a separate distributed Q‐learning function for every agent. Every Q‐function is a parametrization of each agent's control, of the neighborhood controls, and of the neighborhood tracking error. It is also evident that none of the agents has any information on where the leader is connected to and from where she spreads the desired information. The proposed algorithm uses an integral reinforcement learning approach with a separate distributed actor/critic network for each agent: a critic approximator to approximate each value function and an actor approximator to approximate each optimal control law. The derived tuning laws for each actor and critic approximators are designed appropriately by using gradient descent laws. We provide rigorous stability and convergence proofs to show that the closed‐loop system has an asymptotically stable equilibrium point and that the control policies form a graphical Nash equilibrium. We demonstrate the effectiveness of the proposed method on a network consisting of 10 agents. Copyright © 2016 John Wiley & Sons, Ltd.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Q‐learning for continuous‐time graphical games on large networks with completely unknown linear system dynamics

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control

Lead the way for us

Journal: International Journal of Robust and Nonlinear Control	Publication Date: Nov 30, 2016
Citations: 28

Similar Papers

Game-theoretic tracking control for actuator attack attenuation in cyber-physical systems
Kyriakos G Vamvoudakis
-
Kyriakos G VamvoudakisKyriakos G Vamvoudakis
01 Jul 2016
01 Jul 2016

Optimal trajectory Output Tracking control with a Q-learning algorithm
Kyriakos G Vamvoudakis
-
Kyriakos G VamvoudakisKyriakos G Vamvoudakis
01 Jul 2016
01 Jul 2016

Value iteration based integral reinforcement learning approach for H∞ controller design of continuous-time nonlinear systems
Geyang Xiao ... Kun Zhang
Neurocomputing | VOL. 285
Geyang Xiao, et. al.Geyang Xiao ... Kun Zhang
17 Feb 2018
Neurocomputing | VOL. 285

Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems
Kyriakos G Vamvoudakis
Automatica | VOL. 61
Kyriakos G VamvoudakisKyriakos G Vamvoudakis
05 Sep 2015
Automatica | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Q‐learning for continuous‐time graphical games on large networks with completely unknown linear system dynamics

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control