This paper proposes a new 3D Lyapunov guidance vector field(3D-LGV) avoidance strategy based on reinforcement learning for the satellite evasion and interception problem. Combining it with the interfered fluid dynamical system (IFDS) enables the satellite to evade and smoothly enter orbit according to the state of the intercepting satellite in real time. 3D-LGV provides an initial flow field approaching an elliptical orbit, while IFDS provides a perturbed flow field based on the intercepting satellite position. The combined potential field of the initial flow field and the disturbed flow field is the planned velocity direction of the satellite. As a decision-making layer, the proximal policy optimization (PPO) dynamically adjusts the perturbed flow field in the IFDS to increase the avoidance success rate in different scenarios. The experimental results show that, compared with the particle swarm optimization with rolling horizon control algorithm, the algorithm proposed in this paper has a shorter decision time and a higher avoidance success rate. At the same time, Monte Carlo simulation shows that the evasion success rate of the proposed algorithm reaches 98%.
Read full abstract