Abstract
This article investigates the Pareto optimality of infinite horizon cooperative linear quadratic (LQ) differential games by policy iteration technique where the system dynamics are partially or completely unknown. Firstly, the policy iteration algorithm for the approximate solutions of the corresponding algebraic Riccati equation (ARE) without any prior knowledge of the matrix parameters of the dynamic system is derived by collecting the input and state information of each player. Secondly, when the presented specific rank condition is satisfied, the convergence of the proposed algorithm is rigorously demonstrated by recursion. Moreover, the weighting approach is employed to obtain the Pareto optimal strategy and the Pareto optimal solutions on the basis of the convex optimization theory. Finally, simulation results are reported to verify the feasibility and correctness of the proposed theoretical results.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.