Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems

Hirofumi Beppu,Ichiro Maruta,Kenji Fujimoto

doi:10.1016/j.ifacol.2020.12.098

Abstract

In this paper, a new algorithm for realization of approximate dynamic programming (ADP) with Gaussian processes (GPs) for continuous-time (CT) nonlinear input-affine systems is proposed to infinite horizon optimal control problems. The convergence for the ADP algorithm is proven based on the assumption of an exact approximation, where both the cost function and the control input converge to their optimal values, that is, the solution to the Hamilton-Jacobi-Bellman (HJB) equation. The approximation errors, however, are unavoidable in almost every case of applications. In order to tackle the problem, the proposed algorithm is derived with the proof of convergence, where the cost function and the control input, which are both approximated, converge to those of the ADP as the number of data points for GPs approaches infinity. A numerical simulation demonstrates the effectiveness of the proposed algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Journal: IFAC PapersOnLine	Publication Date: Jan 1, 2020
Citations: 1

Similar Papers

Optimal adaptive control of nonlinear continuous-time systems in strict feedback form with unknown internal dynamics
H Zargarzadeh ... S Jagannathan
-
H Zargarzadeh, et. al.H Zargarzadeh ... S Jagannathan
01 Dec 2012
01 Dec 2012

Robust and Optimal Guaranteed Cost Control of Continuous-Time Nonlinear Systems
Derong Liu ... Hongliang Li
-
Derong Liu, et. al.Derong Liu ... Hongliang Li
01 Jan 2017
01 Jan 2017

Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
Travis Dierks ... Balaje T Thumati
Neural Networks | VOL. 22
Travis Dierks, et. al.Travis Dierks ... Balaje T Thumati
01 Jul 2009
Neural Networks | VOL. 22

Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
Travis Dierks ... Sarangapani Jagannathan
Journal of Control Theory and Applications | VOL. 9
Travis Dierks, et. al.Travis Dierks ... Sarangapani Jagannathan
19 Jul 2011
Journal of Control Theory and Applications | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine