Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems.

Bahare Kiumarsi,Frank L Lewis

doi:10.1109/tnnls.2014.2358227

Abstract

This paper presents a partially model-free adaptive optimal control solution to the deterministic nonlinear discrete-time (DT) tracking control problem in the presence of input constraints. The tracking error dynamics and reference trajectory dynamics are first combined to form an augmented system. Then, a new discounted performance function based on the augmented system is presented for the optimal nonlinear tracking problem. In contrast to the standard solution, which finds the feedforward and feedback terms of the control input separately, the minimization of the proposed discounted performance function gives both feedback and feedforward parts of the control input simultaneously. This enables us to encode the input constraints into the optimization problem using a nonquadratic performance function. The DT tracking Bellman equation and tracking Hamilton-Jacobi-Bellman (HJB) are derived. An actor-critic-based reinforcement learning algorithm is used to learn the solution to the tracking HJB equation online without requiring knowledge of the system drift dynamics. That is, two neural networks (NNs), namely, actor NN and critic NN, are tuned online and simultaneously to generate the optimal bounded control policy. A simulation example is given to show the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Oct 8, 2014
Citations: 275

Similar Papers

Decentralized Adaptive Optimal Tracking Control for Massive Multi-agent Systems: An Actor-Critic-Mass Algorithm
Zejian Zhou ... Hao Xu
-
Zejian Zhou, et. al.Zejian Zhou ... Hao Xu
01 Dec 2019
01 Dec 2019

Decentralized Adaptive Tracking Control For Large-Scale Multi-Agent Systems Under Unstructured Environment
Shawon Dey ... Hao Xu
-
Shawon Dey, et. al.Shawon Dey ... Hao Xu
04 Dec 2022
04 Dec 2022

Optimal control of affine nonlinear discrete-time systems
Travis Dierks ... S Jagannthan
-
Travis Dierks, et. al.Travis Dierks ... S Jagannthan
01 Jun 2009
01 Jun 2009

Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems With Experimental Validation
Jing Na ... Jun Zhao
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 52
Jing Na, et. al.Jing Na ... Jun Zhao
02 Jul 2020
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems