A Tour of Reinforcement Learning: The View from Continuous Control

Benjamin Recht

doi:10.1146/annurev-control-053018-023825

Abstract

This article surveys reinforcement learning from the perspective of optimization and control, with a focus on continuous control applications. It reviews the general formulation, terminology, and typical experimental implementations of reinforcement learning as well as competing solution paradigms. In order to compare the relative merits of various techniques, it presents a case study of the linear quadratic regulator (LQR) with unknown dynamics, perhaps the simplest and best-studied problem in optimal control. It also describes how merging techniques from learning theory and control can provide nonasymptotic characterizations of LQR performance and shows that these characterizations tend to match experimental behavior. In turn, when revisiting more complex applications, many of the observed phenomena in LQR persist. In particular, theory and experiment demonstrate the role and importance of models and the cost of generality in reinforcement learning algorithms. The article concludes with a discussion of some of the challenges in designing learning systems that safely and reliably interact with complex and uncertain environments and how tools from reinforcement learning and control might be combined to approach these challenges.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Tour of Reinforcement Learning: The View from Continuous Control

Abstract

Talk to us

Similar Papers

More From: Annual Review of Control, Robotics, and Autonomous Systems

Lead the way for us

Journal: Annual Review of Control, Robotics, and Autonomous Systems	Publication Date: May 3, 2019
Citations: 470

Similar Papers

Model-Free Reinforcement Learning of Minimal-Cost Variance Control
Gangshan Jing ... He Bai
IEEE Control Systems Letters | VOL. 4
Gangshan Jing, et. al.Gangshan Jing ... He Bai
01 Oct 2020
IEEE Control Systems Letters | VOL. 4

Deep Deterministic Policy Gradient to Regulate Feedback Control Systems Using Reinforcement Learning
Samir Salem Al-Bawri ... Dominique M M P Schreurs
Computers, Materials & Continua | VOL. 71
Samir Salem Al-Bawri, et. al.Samir Salem Al-Bawri ... Dominique M M P Schreurs
01 Jan 2021
Computers, Materials & Continua | VOL. 71

Optimal Trajectory Tracking for Cyber-Physical Marine Vessel: Reinforcement Learning Approach
Xiaolei Li ... Jiange Wang
-
Xiaolei Li, et. al.Xiaolei Li ... Jiange Wang
13 Dec 2020
13 Dec 2020

Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics
Chao-Ran Wang ... Huai-Ning Wu
-
Chao-Ran Wang, et. al.Chao-Ran Wang ... Huai-Ning Wu
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Tour of Reinforcement Learning: The View from Continuous Control

Abstract

Talk to us

Similar Papers

More From: Annual Review of Control, Robotics, and Autonomous Systems