Abstract

This study is dedicated to develop an adaptive output-feedback (OPFB) tracking control scheme for continuous-time linear systems to achieve the infinite-horizon linear quadratic tracking (LQT) solution. The existence conditions of the OPFB control for LQT solution are proposed and an upper bound is found for the discount factor to assure the stability of the OPFB solution. To develop an online learning solution without knowing the system drift dynamics, a novel value iteration (VI) algorithm is presented based on the integral reinforcement learning technique which requires measured augmented system states. Moreover, a convergence analysis is proposed for the VI algorithm. Compared to the policy iteration method, the VI algorithm relaxes the initial stabilising control policy requirement. Specifically, in order to further obviate the knowledge of system states, a neural network-based adaptive observer is used during learning and it is no longer needed after the online learning algorithm converges. Effectiveness of the proposed learning scheme is illustrated through an interesting application into a single-phase grid-connected PV power inverter.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call