Abstract
This paper investigates an adaptive dynamic programming (ADP) optimal tracking control algorithm for multi-UAV systems based on zero-sum game theory, addressing external disturbances during flight. By formulating the bounded L 2 gain problem as a two-person zero-sum game between control strategies and external disturbances, the Hamilton-Jacobi-Isaacs (HJI) equation is constructed to derive the Nash equilibrium solution. To overcome the computational challenges of solving the HJI equation, a three-layer neural network structure comprising evaluation, execution, and disturbance networks is employed, integrated with the ADP algorithm to approximate the value function and optimize the control strategy iteratively. Comprehensive simulations demonstrate the proposed method's superior trajectory tracking performance and robustness compared to Sliding Mode Control (SMC). The results confirm the effectiveness of the ADP-based approach in achieving real-time, adaptive control in complex and dynamic multi-UAV environments.
Published Version
Join us for a 30 min session where you can share your feedback and ask us any queries you have