Decoupled Data-Based Approach for Learning to Control Nonlinear Dynamical Systems

Ran Wang,Dileep Kalathil,Suman Chakravorty,Karthikeya S Parunandi,Dan Yu

doi:10.1109/tac.2021.3108552

Ran Wang, Dileep Kalathil + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/tac.2021.3108552

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

This article addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical. This problem is subject to the “curse of dimensionality” associated with the dynamic programming method. This article proposes a novel decoupled data-based control (D2C) algorithm that addresses this problem using a decoupled, “open-loop–closed-loop,” approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, closed-loop control is developed around this open-loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed-loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests a significant reduction in training time compared to other state-of-the-art algorithms.

Full Text