Abstract
In this paper, a bias-policy iteration method for solving the data-driven optimal control problem of unknown continuous-time linear systems is proposed. Firstly, a model-based bias-policy iteration method is given and its convergence is rigorously proved. Then the data-driven implementation for the proposed method is then introduced without using the information of the system matrices. The relationship between the proposed method and the existing policy iteration method and value iteration method is also analyzed. Compared with the existing policy iteration method, the most significant advantage of the proposed method is that, by adding a bias parameter, the condition of the initial admissible controllers can be further relaxed. Simulation examples verify the effectiveness of the proposed bias-policy iteration method.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.