Abstract
This study is dedicated to develop an adaptive output-feedback (OPFB) tracking control scheme for continuous-time linear systems to achieve the infinite-horizon linear quadratic tracking (LQT) solution. The existence conditions of the OPFB control for LQT solution are proposed and an upper bound is found for the discount factor to assure the stability of the OPFB solution. To develop an online learning solution without knowing the system drift dynamics, a novel value iteration (VI) algorithm is presented based on the integral reinforcement learning technique which requires measured augmented system states. Moreover, a convergence analysis is proposed for the VI algorithm. Compared to the policy iteration method, the VI algorithm relaxes the initial stabilising control policy requirement. Specifically, in order to further obviate the knowledge of system states, a neural network-based adaptive observer is used during learning and it is no longer needed after the online learning algorithm converges. Effectiveness of the proposed learning scheme is illustrated through an interesting application into a single-phase grid-connected PV power inverter.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.