Low-complexity learning of Linear Quadratic Regulators from noisy data

Claudio De Persis,Pietro Tesi

doi:10.1016/j.automatica.2021.109548

Claudio De Persis, Pietro Tesi

Open Access

https://doi.org/10.1016/j.automatica.2021.109548

Copy DOI

Journal: Automatica	Publication Date: Mar 22, 2021
Citations: 75	License type: cc-by

Affiliation: University of Groningen, University of Florence

Abstract

This paper considers the Linear Quadratic Regulator problem for linear systems with unknown dynamics, a central problem in data-driven control and reinforcement learning. We propose a method that uses data to directly return a controller without estimating a model of the system. Sufficient conditions are given under which this method returns a stabilizing controller with guaranteed relative error when the data used to design the controller are affected by noise. This method has low complexity as it only requires a finite number of samples of the system response to a sufficiently exciting input, and can be efficiently implemented as a semi-definite programme.

Highlights

Control theory is witnessing an increasing renewed interest towards data-driven control
This paper considers the infinite horizon Linear Quadratic Regulator (LQR) problem for linear time-invariant systems, which is one of the problems more studied in the control literature
Where P is the controllability Gramian of the closed-loop system (5), which is the unique solution to (A + BK)P (A + BK)⊤ − P + I = 0 (7). This corresponds in the time domain to the 2-norm of the output z when impulses are applied to the input channels, and can be interpreted as the mean-square deviation of z when d is a white process with unit covariance, which is the classic stochastic LQR formulation

Summary

Introduction

Control theory is witnessing an increasing renewed interest towards data-driven (data-based) control. Starting from [Fiechter, 1997], a tremendous effort has been made for establishing non-asymptotic properties of data-driven methods This term refers to all those methods that aim at providing closedloop stability and performance guarantees using only a finite number of data points. A strength of our method (of direct methods in general) is a parsimonious use of such priors, which allows us to cope with situations where the noise has no convenient statistics In such situations indirect methods (at least those proposed for LQR) are instead much more difficult to pursue since the ID step is strongly reliant on such statistics [Mania et al, 2019, Dean et al, 2019]. This result states that a (noise-free) system trajectory generated by a persistently exciting input is a data-based non-parametric system model.

Notation and auxiliary facts

Problem definition and data-driven formulation

A data-driven SDP formulation

Data-driven solution with noisy data

Stability and performance analysis

Preliminary discussion

Noise robustness through soft constraints

Alternative based on the S-procedure

Stability and H2-norm bounds

Bounds on the relative error

Nonlinear systems

De-noising through averaging

Random linear systems

Nonlinear inverted pendulum

Concluding remarks

Findings

A Appendix

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-complexity learning of Linear Quadratic Regulators from noisy data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automatica

Lead the way for us

Similar Papers

Further results on the regulation problem for linear systems with constraints on control and its increment
A Abdelhak ... R Ou-Azzou
Mathematical Modeling and Computing | VOL. 10
A Abdelhak, et. al.A Abdelhak ... R Ou-Azzou
01 Jan 2023
Mathematical Modeling and Computing | VOL. 10

A Tour of Reinforcement Learning: The View from Continuous Control
Benjamin Recht
Annual Review of Control, Robotics, and Autonomous Systems | VOL. 2
Benjamin RechtBenjamin Recht
03 May 2019
Annual Review of Control, Robotics, and Autonomous Systems | VOL. 2

Direct policy search with extremum seeking
Megumi Miyashita ... Ryo Hirotani
-
Megumi Miyashita, et. al.Megumi Miyashita ... Ryo Hirotani
01 Sep 2017
01 Sep 2017

Analysis of an evolutionary reinforcement learning method in a multiagent domain
...
-
, et. al. ...
12 May 2008
12 May 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-complexity learning of Linear Quadratic Regulators from noisy data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automatica