Constrained and Stabilizing Stacked Adaptive Dynamic Programming and a Comparison with Model Predictive Control

Lukas Beckenbach,Thomas Gohrt,Stefan Streif,Pavel Osinenko

doi:10.23919/ecc.2018.8550545

Abstract

Model predictive control (MPC) is in many applications the de facto approach to optimal control. It typically provides an optimal input (sequence) for a finite-horizon of given running costs. Another approach, called dynamic programming (DP), is based on the Hamilton-Jacobi-Bellman formalism and usually seeks optimal inputs over an infinite horizon of running costs. Unlike MPC, DP is much less computationally tractable and typically requires state space discretization which leads to the so-called curse of dimensionality. Adaptive dynamic programming (ADP), an approach based on reinforcement learning, seeks to address the difficulties of DP by introducing approximation models for the optimal cost function and control policies. In a variant of ADP called stacked ADP (sADP), control policies are optimized over a finite stack of value function approximants, thus making it somewhat similar to MPC. First, similarities and differences between a variant of ADP and MPC are discussed. Second, MPC stability results are transferred to ADP and state and input constraints are considered. The work is concluded by a case study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Constrained and Stabilizing Stacked Adaptive Dynamic Programming and a Comparison with Model Predictive Control

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning-Based Predictive Control for Discrete-Time Nonlinear Systems With Stochastic Disturbances.
Xin Xu ... Dazi Li
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Xin Xu, et. al.Xin Xu ... Dazi Li
09 May 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

Editorial Special Issue on Adaptive Dynamic Programming and Reinforcement Learning
Derong Liu ... Qinglai Wei
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50
Derong Liu, et. al.Derong Liu ... Qinglai Wei
26 Oct 2020
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50

Data‐based learning control for optimization of nonlinear systems
Qinglai Wei ... Pinjia Zhang
Optimal Control Applications and Methods | VOL. 44
Qinglai Wei, et. al.Qinglai Wei ... Pinjia Zhang
09 Mar 2023
Optimal Control Applications and Methods | VOL. 44

Adaptive Dynamic Programming for Multi-intersections Traffic Signal Intelligent Control
Tao Li ... Jianqiang Yi
-
Tao Li, et. al.Tao Li ... Jianqiang Yi
01 Oct 2008
01 Oct 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constrained and Stabilizing Stacked Adaptive Dynamic Programming and a Comparison with Model Predictive Control

Abstract

Talk to us

Similar Papers