Online solution to the linear quadratic tracking problem of continuous-time systems using reinforcement learning

Hamidreza Modares,Frank L Lewis

doi:10.1109/cdc.2013.6760477

Abstract

In this paper, reinforcement learning (RL) is employed to find a casual solution to the linear quadratic tracker (LQT) for continuous-time systems online in real time. Although several RL techniques are developed in the literature to solve the LQ regulator, to our knowledge, there is no rigorous result for using RL to solve the LQ tracker. This is mainly because of the requirement for computing a feedforward term in the tracker control which must be done in a noncausal manner backwards in time. To deal with this noncausality problem, an augmented system composed of the original system and the command generator dynamics is constructed, and an augmented LQT algebraic Riccati equation is derived for solving the LQT problem. In this formulation, one can apply RL techniques to solve the LQT problem, computing the feedforward term and the feedback term simultaneously online in real time. The convergence of the proposed online algorithms to the optimal control solution is verified. To show the efficiency of the proposed approach, a simulation example is provided.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online solution to the linear quadratic tracking problem of continuous-time systems using reinforcement learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
Hamidreza Modares ... Frank L Lewis
IEEE Transactions on Automatic Control | VOL. 59
Hamidreza Modares, et. al.Hamidreza Modares ... Frank L Lewis
01 Nov 2014
IEEE Transactions on Automatic Control | VOL. 59

Linear quadratic output tracking and disturbance rejection
Masoud Karimi-Ghartemani ... Alireza Bakhshai
International Journal of Control | VOL. 84
Masoud Karimi-Ghartemani, et. al.Masoud Karimi-Ghartemani ... Alireza Bakhshai
01 Aug 2011
International Journal of Control | VOL. 84

Off-Policy Reinforcement Learning for Tracking in Continuous-Time Systems on Two Time Scales
Wenqian Xue ... Yi Jiang
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Wenqian Xue, et. al.Wenqian Xue ... Yi Jiang
09 Sep 2020
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

Reinforcement learning for optimal tracking and regulation: A unified framework
F L Lewis ... H Modares
-
F L Lewis, et. al.F L Lewis ... H Modares
01 Jul 2015
01 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online solution to the linear quadratic tracking problem of continuous-time systems using reinforcement learning

Abstract

Talk to us

Similar Papers