Approximate dynamic programming recurrence relations for a hybrid optimal control problem

W Lu,T A Wettergren,R Fierro,S Ferrari

doi:10.1117/12.919286

Abstract

This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discrete event controller layer, a discrete-continuous interface layer, and a continuous state layer. The hybrid optimal control problem (HOCP) is to nd the optimal discrete event decisions and the optimal continuous controls subject to a deterministic minimization of a scalar function regarding the system state and control over time. Due to the uncertainty of environment and complexity of the HOCP, the cost-to-go cannot be evaluated before the HDS explores the entire system state space; as a result, the optimal control, neither continuous nor discrete, is not available ahead of time. Therefore, ADP is adopted to learn the optimal control while the HDS is exploring the environment, because of the online advantage of ADP method. Furthermore, ADP can break the curses of dimensionality which other optimizing methods, such as dynamic programming (DP) and Markov decision process (MDP), are facing due to the high dimensions of HOCP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approximate dynamic programming recurrence relations for a hybrid optimal control problem

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Approximate Dynamic Programming with (min; +) linear function approximation for Markov decision processes
L Chandrashekar ... Shalabh Bhatnagar
-
L Chandrashekar, et. al.L Chandrashekar ... Shalabh Bhatnagar
01 Dec 2014
01 Dec 2014

Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems With Experimental Validation
Jing Na ... Yongfeng Lv
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 52
Jing Na, et. al.Jing Na ... Yongfeng Lv
02 Jul 2020
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 52

State Transition Tensors for Continuous-Thrust Control of Three-Body Relative Motion
Jackson Kulik ... Dmitry Savransky
Journal of Guidance, Control, and Dynamics | VOL. 46
Jackson Kulik, et. al.Jackson Kulik ... Dmitry Savransky
09 May 2023
Journal of Guidance, Control, and Dynamics | VOL. 46

Approximate dynamic programming based optimal control applied to an integrated plant with a reactor and a distillation column with recycle
Thidarat Tosukhowong ... Jay H Lee
AIChE Journal | VOL. 55
Thidarat Tosukhowong, et. al.Thidarat Tosukhowong ... Jay H Lee
20 Feb 2009
AIChE Journal | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate dynamic programming recurrence relations for a hybrid optimal control problem

Abstract

Talk to us

Similar Papers