Safe Approximate Dynamic Programming via Kernelized Lipschitz Estimation.

Ankush Chakrabarty,Yebin Wang,Devesh K. Jha,Kyriakos G. Vamvoudakis,Gregery T. Buzzard

doi:10.1109/tnnls.2020.2978805

Safe Approximate Dynamic Programming via Kernelized Lipschitz Estimation.

Ankush Chakrabarty, Yebin Wang + Show 3 more

Open Access

https://doi.org/10.1109/tnnls.2020.2978805

Copy DOI

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Mar 19, 2020
Citations: 62	License type: publisher-specific, author manuscript

Affiliation: Mitsubishi Electric (United States), Georgia Institute of Technology, Purdue University West Lafayette

#Approximate Dynamic Programming #Constraint Enforcement + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ the kernelized Lipschitz estimation to learn multiplier matrices that are used in semidefinite programming frameworks for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initialization and constraint enforcement while providing exponential stability of the equilibrium of the closed-loop system.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE transactions on neural networks and learning systems

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.