Adaptive dynamic programming for model‐free tracking of trajectories with time‐varying parameters

Florian Köpf,Simon Ramsteiner,Sören Hohmann,Luca Puccetti,Michael Flad

doi:10.1002/acs.3106

Abstract

SummaryRecently proposed adaptive dynamic programming (ADP) tracking controllers assume that the reference trajectory follows time‐invariant exo‐system dynamics—an assumption that does not hold for many applications. In order to overcome this limitation, we propose a new Q‐function that explicitly incorporates a parametrized approximation of the reference trajectory. This allows learning to track a general class of trajectories by means of ADP. Once our Q‐function has been learned, the associated controller handles time‐varying reference trajectories without the need for further training and independent of exo‐system dynamics. After proposing this general model‐free off‐policy tracking method, we provide an analysis of the important special case of linear quadratic tracking. An example demonstrates that our new method successfully learns the optimal tracking controller and outperforms existing approaches in terms of tracking error and cost.

Highlights

Adaptive and iterative learning controllers are a powerful tool in case of unknown or partially unknown system dynamics[1,2,3,4,5] or in multiagent coordination problems.[6]
Summary Recently proposed adaptive dynamic programming (ADP) tracking controllers assume that the reference trajectory follows time-invariant exo-system dynamics—an assumption that does not hold for many applications
In order to validate our proposed parametrized reference ADP (PRADP) tracking method, we show simulation results where the reference trajectory is parametrized by means of cubic polynomials.* we compare the results with an ADP tracking method that assumes that the reference can be described by a time-invariant exo-system f ref(rk)

Summary

Introduction

Adaptive and iterative learning controllers are a powerful tool in case of unknown or partially unknown system dynamics[1,2,3,4,5] or in multiagent coordination problems.[6] For the data-based tuning of optimal controllers, where the objective is to minimize a cost functional, adaptive dynamic programming (ADP), which is a method of reinforcement learning, has recently gained extensive attention.[7] In ADP, the controller adapts its behavior based on its interaction with an unknown system and the associated cost signals.[8]. The aim is to track a desired reference trajectory optimally w.r.t. a given objective function for a system with unknown dynamics and where no explicit system model is used (ie, the model-free setting is considered). The objective function quantifies the control objectives and typically penalizes the control effort and/or the deviation of the system state from the desired trajectory. Examples that require the tracking of flexible and time-varying trajectories are the longitudinal

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Adaptive Control and Signal Processing	Publication Date: Mar 3, 2020
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Adaptive dynamic programming for model‐free tracking of trajectories with time‐varying parameters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Adaptive Control and Signal Processing

Lead the way for us

Similar Papers

Evaluation of a Disaster‐Surge Training for Public Health Nurses
Michelle Chiu ... Barbara J Polivka
Public Health Nursing | VOL. 29
Michelle Chiu, et. al.Michelle Chiu ... Barbara J Polivka
22 Nov 2011
Public Health Nursing | VOL. 29

Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm
Xiaofeng Li ... Changyin Sun
Neurocomputing | VOL. 314
Xiaofeng Li, et. al.Xiaofeng Li ... Changyin Sun
07 Jun 2018
Neurocomputing | VOL. 314

Optimal Tracking Control and Stabilization for Stochastic Systems with Multi-Step Input Delay
Chunyan Han ... Yue Liu
-
Chunyan Han, et. al.Chunyan Han ... Yue Liu
01 Jul 2018
01 Jul 2018

H∞-Optimal Tracking Controller for Three-Wheeled Omnidirectional Mobile Robots with Uncertain Dynamics
Amir Salimi Lafmejani ... Spring Berman
-
Amir Salimi Lafmejani, et. al.Amir Salimi Lafmejani ... Spring Berman
24 Oct 2020
24 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive dynamic programming for model‐free tracking of trajectories with time‐varying parameters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Adaptive Control and Signal Processing