Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System

Kang Cheng,Kanjian Zhang,Haikun Wei,Shumin Fei

doi:10.1007/s10957-015-0809-6

Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System

Kang Cheng, Kanjian Zhang + Show 2 more

https://doi.org/10.1007/s10957-015-0809-6

Copy DOI

Export

Save

Cite

Journal: Journal of Optimization Theory and Applications

Publication Date: Dec 29, 2015

Affiliation: Southeast University

#Temporal Difference Learning Method #Average Cost Criterion #Policy Iteration Algorithm #Parameterized Control Law #Optimal Control Parameters #Policy Iteration #Control Of Dynamic System #Potential Function #Potential-based Method #Linear Quadratic Problem

Abstract
Full-Text
Similar Papers

Abstract

Listen

In the paper, a potential-based policy iteration method is proposed for optimal control of a stochastic dynamic system with an average cost criterion and a parameterized control law. In this method, the potential function and the optimal control parameters are obtained via a least-squares-based approach. The potential estimation algorithm is derived from a temporal difference learning method, which can be viewed as a continuous version of the least-squares policy evaluation algorithm. The policy iteration algorithm is validated by solving a linear quadratic gaussian problem in the simulation.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Optimization Theory and Applications

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System