Safe reinforcement learning for dynamical games

Yongliang Yang,Hamidreza Modares,Kyriakos G Vamvoudakis

doi:10.1002/rnc.4962

Abstract

SummaryThis article presents a novel actor‐critic‐barrier structure for the multiplayer safety‐critical systems. Non‐zero‐sum (NZS) games with full‐state constraints are first transformed into unconstrained NZS games using a barrier function. The barrier function is capable of dealing with both symmetric and asymmetric constraints on the state. It is shown that the Nash equilibrium of the unconstrained NZS guarantees to stabilize the original multiplayer system. The barrier function is combined with an actor‐critic structure to learn the Nash equilibrium solution in an online fashion. It is shown that integrating the barrier function with the actor‐critic structure guarantees that the constraints will not be violated during learning. Boundedness and stability of the closed‐loop signals are analyzed. The efficacy of the presented approach is finally demonstrated by using a simulation example.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International journal of robust and nonlinear control	Publication Date: Mar 25, 2020
Citations: 68	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Safe reinforcement learning for dynamical games

Abstract

Talk to us

Similar Papers

More From: International journal of robust and nonlinear control

Lead the way for us

Similar Papers

Three-Dimensional Reachability Set For a Dubins Car: Reduction of the General Case of Rotation Constraints to the Canonical Case
В С Пацко ... А А Федотов
Известия Российской академии наук. Теория и системы управления | VOL. -
В С Пацко, et. al.В С Пацко ... А А Федотов
01 Jul 2023
Известия Российской академии наук. Теория и системы управления | VOL. -

Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations
Kyriakos G Vamvoudakis ... Frank L Lewis
Automatica | VOL. 47
Kyriakos G Vamvoudakis, et. al.Kyriakos G Vamvoudakis ... Frank L Lewis
27 Mar 2011
Automatica | VOL. 47

Safe adaptive learning algorithm with neural network implementation for H∞ control of nonlinear safety‐critical system
Chunbin Qin ... Qiyang Xiao
International journal of robust and nonlinear control | VOL. 33
Chunbin Qin, et. al.Chunbin Qin ... Qiyang Xiao
31 Oct 2022
International journal of robust and nonlinear control | VOL. 33

District cooling system control for providing regulation services based on safe reinforcement learning with barrier functions
Peipei Yu ... Yonghua Song
Applied energy | VOL. 347
Peipei Yu, et. al.Peipei Yu ... Yonghua Song
16 Jun 2023
Applied energy | VOL. 347

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Safe reinforcement learning for dynamical games

Abstract

Talk to us

Similar Papers

More From: International journal of robust and nonlinear control