Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

Bo Pang,Zhong-Ping Jiang

doi:10.1609/aaai.v35i10.17122

Abstract

This paper studies the robustness of reinforcement learning algorithms to errors in the learning process. Specifically, we revisit the benchmark problem of discrete-time linear quadratic regulation (LQR) and study the long-standing open question: Under what conditions is the policy iteration method robustly stable from a dynamical systems perspective? Using advanced stability results in control theory, it is shown that policy iteration for LQR is inherently robust to small errors in the learning process and enjoys small-disturbance input-to-state stability: whenever the error in each iteration is bounded and small, the solutions of the policy iteration algorithm are also bounded, and, moreover, enter and stay in a small neighborhood of the optimal LQR solution. As an application, a novel off-policy optimistic least-squares policy iteration for the LQR problem is proposed, when the system dynamics are subjected to additive stochastic disturbances. The proposed new results in robust reinforcement learning are validated by a numerical example.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 14

Similar Papers

Robust Reinforcement Learning for Stochastic Linear Quadratic Control with Multiplicative Noise
Bo Pang ... Zhong-Ping Jiang
IFAC PapersOnLine | VOL. 54
Bo Pang, et. al.Bo Pang ... Zhong-Ping Jiang
01 Jan 2020
IFAC PapersOnLine | VOL. 54

An Overview of Robust Reinforcement Learning
Shiyu Chen ... Yanjie Li
-
Shiyu Chen, et. al.Shiyu Chen ... Yanjie Li
30 Oct 2020
30 Oct 2020

Curricular Robust Reinforcement Learning via GAN-Based Perturbation Through Continuously Scheduled Task Sequence
Yike Li ... Endong Tong
Tsinghua Science & Technology | VOL. 28
Yike Li, et. al.Yike Li ... Endong Tong
01 Feb 2023
Tsinghua Science & Technology | VOL. 28

Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
Jae Young Lee ... Yoon Ho Choi
Automatica | VOL. 48
Jae Young Lee, et. al.Jae Young Lee ... Yoon Ho Choi
27 Aug 2012
Automatica | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence