Model-free design of stochastic LQR controller from a primal–dual optimization perspective

Man Li,Jiahu Qin,Wei Xing Zheng,Yaonan Wang,Yu Kang

doi:10.1016/j.automatica.2022.110253

Abstract

To further understand the underlying mechanism of various reinforcement learning (RL) algorithms and also to better use the optimization theory to make further progress in RL, many researchers begin to revisit the linear–quadratic regulator (LQR) problem, whose setting is simple and yet captures the characteristics of RL. Inspired by this, this work is concerned with the model-free design of stochastic LQR controller for linear systems subject to Gaussian noises, from the perspective of primal–dual optimization. We first reformulate the stochastic LQR problem as a constrained non-convex optimization problem, which is shown to have strong duality. Then, to solve this non-convex optimization problem, we propose a model-based primal–dual (MB-PD) algorithm based on the properties of the resulting Karush–Kuhn–Tucker (KKT) conditions. We also give a model-free implementation for the MB-PD algorithm by solving a transformed dual feasibility condition. More importantly, we establish the connection between the proposed MB-PD algorithm and classical policy iteration algorithm, which provides a novel primal–dual optimization perspective to understand the common RL algorithms. Finally, we provide a high-dimensional case study to show the performance of the proposed algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model-free design of stochastic LQR controller from a primal–dual optimization perspective

Abstract

Talk to us

Similar Papers

More From: Automatica

Lead the way for us

Journal: Automatica	Publication Date: Mar 28, 2022
Citations: 10

Similar Papers

MAGLEV system modeling and LQR controller design in real time simulation
Debasis Maji ... Indranil Dey
-
Debasis Maji, et. al.Debasis Maji ... Indranil Dey
01 Mar 2016
01 Mar 2016

Towards High-Precision Quadrotor Trajectory Following Capabilities: Modelling, Parameter Estimation, and LQR Control
A Hanif ... I E Putro
Latvian Journal of Physics and Technical Sciences | VOL. 61
A Hanif, et. al.A Hanif ... I E Putro
30 Mar 2024
Latvian Journal of Physics and Technical Sciences | VOL. 61

Design Optimization of Car-Trailer Combinations With Electronic Stability Systems
Tao Sun ... Yuping He
-
Tao Sun, et. al.Tao Sun ... Yuping He
14 Nov 2014
14 Nov 2014

Model-Free Reinforcement Learning of Minimal-Cost Variance Control
Gangshan Jing ... He Bai
IEEE Control Systems Letters | VOL. 4
Gangshan Jing, et. al.Gangshan Jing ... He Bai
01 Oct 2020
IEEE Control Systems Letters | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model-free design of stochastic LQR controller from a primal–dual optimization perspective

Abstract

Talk to us

Similar Papers

More From: Automatica