Relaxed Actor-Critic With Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

Jingliang Duan,Qiang Ge,Fei Ma,Jie Li,Shengbo Eben Li,Dezhao Zhang,Monimoy Bujarbaruah

doi:10.1109/tiv.2023.3255264

Abstract

This paper presents the Relaxed Continuous-Time Actor-critic (RCTAC) algorithm, a method for finding the nearly optimal policy for nonlinear continuous-time (CT) systems with known dynamics and infinite horizon, such as the path-tracking control of vehicles. RCTAC has several advantages over existing adaptive dynamic programming algorithms for CT systems. It does not require the ``admissibility" of the initialized policy or the input-affine nature of controlled systems for convergence. Instead, given any initial policy, RCTAC can converge to an admissible, and subsequently nearly optimal policy for a general nonlinear system with a saturated controller. RCTAC consists of two phases: a warm-up phase and a generalized policy iteration phase. The warm-up phase minimizes the square of the Hamiltonian to achieve admissibility, while the generalized policy iteration phase relaxes the update termination conditions for faster convergence. The convergence and optimality of the algorithm are proven through Lyapunov analysis, and its effectiveness is demonstrated through simulations and real-world path-tracking tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Relaxed Actor-Critic With Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Vehicles

Lead the way for us

Journal: IEEE Transactions on Intelligent Vehicles	Publication Date: May 1, 2023
Citations: 8

Similar Papers

Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties
Ding Wang ... Chaoxu Mu
Information Sciences | VOL. 366
Ding Wang, et. al.Ding Wang ... Chaoxu Mu
27 May 2016
Information Sciences | VOL. 366

Knowledge-Data Driven Optimal Control for Nonlinear Systems and Its Application to Wastewater Treatment Process.
Honggui Han ... Junfei Qiao
IEEE transactions on cybernetics | VOL. 54
Honggui Han, et. al.Honggui Han ... Junfei Qiao
01 Oct 2024
IEEE transactions on cybernetics | VOL. 54

Kernel-Based Adaptive Critic Designs for Optimal Control of Nonlinear Discrete-Time System
Fuxiao Tan ... Xinping Guan
-
Fuxiao Tan, et. al.Fuxiao Tan ... Xinping Guan
01 Jul 2018
01 Jul 2018

Self-triggering adaptive optimal control for nonlinear systems based on encoding mechanism
Xuyang Lou ... Zheng Ji
Mathematics and Computers in Simulation | VOL. 190
Xuyang Lou, et. al.Xuyang Lou ... Zheng Ji
06 Jul 2021
Mathematics and Computers in Simulation | VOL. 190

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relaxed Actor-Critic With Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Vehicles