Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees.

Brent A Wallace,Jennie Si

doi:10.1109/tnnls.2024.3392237

Abstract

Continuous-time reinforcement learning (CT-RL) methods hold great promise in real-world applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially their theoretical developments, have achieved great successes. However, these methods have not been demonstrated for solving realistic or meaningful learning control problems. Thus, the goal of this work is to introduce a suite of new excitable integral reinforcement learning (EIRL) algorithms for control of CT affine nonlinear systems. This work develops a new excitation framework to improve persistence of excitation (PE) and numerical performance via input/output insights from classical control. Furthermore, when the system dynamics afford a physically-motivated partition into distinct dynamical loops, the proposed methods break the control problem into smaller subproblems, resulting in reduced complexity. By leveraging the known affine nonlinear dynamics, the methods achieve well-behaved system responses and considerable data efficiency. The work provides convergence, solution optimality, and closed-loop stability guarantees of the proposed methods, and it demonstrates these guarantees on a significant application problem of controlling an unstable, nonminimum phase hypersonic vehicle (HSV).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Jan 1, 2024
Citations: 1

Similar Papers

Data‐based learning control for optimization of nonlinear systems
Qinglai Wei ... Pinjia Zhang
Optimal Control Applications and Methods | VOL. 44
Qinglai Wei, et. al.Qinglai Wei ... Pinjia Zhang
09 Mar 2023
Optimal Control Applications and Methods | VOL. 44

Editorial Special Issue on Adaptive Dynamic Programming and Reinforcement Learning
Derong Liu ... Qinglai Wei
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50
Derong Liu, et. al.Derong Liu ... Qinglai Wei
26 Oct 2020
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50

Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations
Brent A Wallace ... Jennie Si
Journal of Guidance, Control, and Dynamics | VOL. -
Brent A Wallace, et. al.Brent A Wallace ... Jennie Si
01 Aug 2024
Journal of Guidance, Control, and Dynamics | VOL. -

Goal Representation Adaptive Dynamic Programming for Machine Intelligence

-

23 Jul 2015
23 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems