Data-Driven Control of Unknown Systems: A Linear Programming Approach

Alexandros Tanzanakis,John Lygeros

doi:10.1016/j.ifacol.2020.12.027

Abstract

We consider the problem of discounted optimal state-feedback regulation for general unknown deterministic discrete-time systems. It is well known that open-loop instability of systems, non-quadratic cost functions and complex nonlinear dynamics, as well as the on-policy behavior of many reinforcement learning (RL) algorithms, make the design of model-free optimal adaptive controllers a challenging task. We depart from commonly used least-squares and neural network approximation methods in conventional model-free control theory, and propose a novel family of data-driven optimization algorithms based on linear programming, off-policy Q-learning and randomized experience replay. We develop both policy iteration (PI) and value iteration (VI) methods to compute an approximate optimal feedback controller with high precision and without the knowledge of a system model and stage cost function. Simulation studies confirm the effectiveness of the proposed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IFAC PapersOnLine	Publication Date: Jan 1, 2020
Citations: 7	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Data-Driven Control of Unknown Systems: A Linear Programming Approach

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Similar Papers

A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
Huizhen Yu ... Dimitri P Bertsekas
Mathematics of Operations Research | VOL. 40
Huizhen Yu, et. al.Huizhen Yu ... Dimitri P Bertsekas
01 Oct 2015
Mathematics of Operations Research | VOL. 40

Model-based and Model-free Optimal Control of Biomechanical SIP Model
Muhammad Haras ... Kamran Iqbal
-
Muhammad Haras, et. al.Muhammad Haras ... Kamran Iqbal
01 Sep 2022
01 Sep 2022

Approximate Dynamic Programming and Reinforcement Learning
Lucian Buşoniu ... Robert Babuška
-
Lucian Buşoniu, et. al.Lucian Buşoniu ... Robert Babuška
01 Jan 2009
01 Jan 2009

Randomised Procedures for Initialising and Switching Actions in Policy Iteration
Shivaram Kalyanakrishnan ... Neeldhara Misra
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 30
Shivaram Kalyanakrishnan, et. al.Shivaram Kalyanakrishnan ... Neeldhara Misra
05 Mar 2016
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-Driven Control of Unknown Systems: A Linear Programming Approach

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine