Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method

Lianghao Ji,Kai Jian,Shasha Yang,Huaqing Li,Xing Guo,Cuijuan Zhang

doi:10.1049/cth2.12473

Lianghao Ji, Kai Jian + Show 4 more

Open Access

https://doi.org/10.1049/cth2.12473

Copy DOI

Abstract

AbstractThis paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete‐time multi‐agent systems with completely unknown dynamics. Different from the classical RL‐based optimal control algorithms based on one‐step temporal difference method, a multi‐step‐based (also call n‐step) policy gradient ADP (MS‐PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q‐function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor‐critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IET Control Theory & Applications	Publication Date: May 4, 2023
Citations: 3	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method

Abstract

Talk to us

Similar Papers

More From: IET Control Theory & Applications

Lead the way for us

Similar Papers

Adaptive dynamic programming for linear impulse systems
Xiao-Hua Wang ... Hua Wang
Journal of Zhejiang University SCIENCE C | VOL. 15
Xiao-Hua Wang, et. al.Xiao-Hua Wang ... Hua Wang
01 Jan 2014
Journal of Zhejiang University SCIENCE C | VOL. 15

Online Optimal Control of Continuous-Time Affine Nonlinear Systems
Derong Liu ... Xiong Yang
-
Derong Liu, et. al.Derong Liu ... Xiong Yang
01 Jan 2017
01 Jan 2017

Neural-network estimators based fault-tolerant tracking control for AUV via ADP with rudders faults and ocean current disturbance
Gaofeng Che ... Zhen Yu
Neurocomputing | VOL. 411
Gaofeng Che, et. al.Gaofeng Che ... Zhen Yu
16 Jun 2020
Neurocomputing | VOL. 411

Data-driven Nonlinear MIMO ADP Method and Its Application in PMSM Control
Bowen Sui
International Journal of Scientific Research in Science, Engineering and Technology | VOL. 6
Bowen SuiBowen Sui
01 Dec 2019
International Journal of Scientific Research in Science, Engineering and Technology | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method

Abstract

Talk to us

Similar Papers

More From: IET Control Theory &amp; Applications

More From: IET Control Theory & Applications