Mixed H2/H∞-Policy Learning Synthesis

Lekan Molu

doi:10.1016/j.ifacol.2023.10.148

Mixed H2/H∞-Policy Learning Synthesis

Lekan Molu

Open Access

https://doi.org/10.1016/j.ifacol.2023.10.148

Copy DOI

Journal: IFAC PapersOnLine

Publication Date: Jan 1, 2023

Affiliation: Microsoft (United States)

#Generalized Algebraic Riccati Equation #Stochastic Differential Calculus + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

A robustly stabilizing optimal control policy in a model-free mixed H2/H∞-control setting is here put forward for counterbalancing the slow convergence and non-robustness of traditional high-variance policy optimization (and by extension policy gradient) algorithms. Leveraging Itô’s stochastic differential calculus, we iteratively solve the system's continuous-time (closed-loop) generalized algebraic Riccati equation(GARE) whilst updating its admissible controllers in a two-player, zero-sum differential game setting. Our new results are illustrated by learning-enabled control systems which gather previously disseminated results in this field in one holistic data-driven presentation with greater simplification, improvement, and clarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IFAC PapersOnLine

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.