Output Feedback Q-Learning Control for the Discrete-Time Linear Quadratic Regulator Problem.

Syed Ali Asad Rizvi,Zongli Lin

doi:10.1109/tnnls.2018.2870075

Abstract

Approximate dynamic programming (ADP) and reinforcement learning (RL) have emerged as important tools in the design of optimal and adaptive control systems. Most of the existing RL and ADP methods make use of full-state feedback, a requirement that is often difficult to satisfy in practical applications. As a result, output feedback methods are more desirable as they relax this requirement. In this paper, we present a new output feedback-based Q-learning approach to solving the linear quadratic regulation (LQR) control problem for discrete-time systems. The proposed scheme is completely online in nature and works without requiring the system dynamics information. More specifically, a new representation of the LQR Q-function is developed in terms of the input-output data. Based on this new Q-function representation, output feedback LQR controllers are designed. We present two output feedback iterative Q-learning algorithms based on the policy iteration and the value iteration methods. This scheme has the advantage that it does not incur any excitation noise bias, and therefore, the need of using discounted cost functions is circumvented, which in turn ensures closed-loop stability. It is shown that the proposed algorithms converge to the solution of the LQR Riccati equation. A comprehensive simulation study is carried out, which illustrates the proposed scheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Output Feedback Q-Learning Control for the Discrete-Time Linear Quadratic Regulator Problem.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Oct 8, 2018
Citations: 95

Similar Papers

Intelligent Linear-Quadratic Optimal Output Feedback Regulator for a Deregulated Automatic Generation Control System
Elyas Rakhshani
Electric Power Components and Systems | VOL. 40
Elyas RakhshaniElyas Rakhshani
01 Mar 2012
Electric Power Components and Systems | VOL. 40

Reinforcement Learning-Based Linear Quadratic Regulation of Continuous-Time Systems Using Dynamic Output Feedback.
Syed Ali Asad Rizvi ... Zongli Lin
IEEE Transactions on Cybernetics | VOL. 50
Syed Ali Asad Rizvi, et. al.Syed Ali Asad Rizvi ... Zongli Lin
03 Jan 2019
IEEE Transactions on Cybernetics | VOL. 50

LQR Control for Homogeneous Agents with Multi-graph Topology
Dong-Mei Zhang ... Lin-Lin Ou
Acta Automatica Sinica | VOL. 39
Dong-Mei Zhang, et. al.Dong-Mei Zhang ... Lin-Lin Ou
25 Mar 2014
Acta Automatica Sinica | VOL. 39

On a general optimal algorithm for multirate output feedback controllers for linear stochastic periodic systems
Nie-Zen Yen ... Yung-Chun Wu
IEEE Transactions on Automatic Control | VOL. 38
Nie-Zen Yen, et. al. Nie-Zen Yen ... Yung-Chun Wu
01 Jun 1993
IEEE Transactions on Automatic Control | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Output Feedback Q-Learning Control for the Discrete-Time Linear Quadratic Regulator Problem.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems