Two-Stage Pursuit Strategy for Incomplete-Information Impulsive Space Pursuit-Evasion Mission Using Reinforcement Learning

Bin Yang,Pengxuan Liu,Shuang Li,Jinglang Feng

doi:10.3390/aerospace8100299

Abstract

This paper presents a novel and robust two-stage pursuit strategy for the incomplete-information impulsive space pursuit-evasion missions considering the J2 perturbation. The strategy firstly models the impulsive pursuit-evasion game problem into a far-distance rendezvous stage and a close-distance game stage according to the perception range of the evader. For the far-distance rendezvous stage, it is transformed into a rendezvous trajectory optimization problem and a new objective function is proposed to obtain the pursuit trajectory with the optimal terminal pursuit capability. For the close-distance game stage, a closed-loop pursuit approach is proposed using one of the reinforcement learning algorithms, i.e., the deep deterministic policy gradient algorithm, to solve and update the pursuit trajectory for the incomplete-information impulsive pursuit-evasion missions. The feasibility of this novel strategy and its robustness to different initial states of the pursuer and evader and to the evasion strategies are demonstrated for the sun-synchronous orbit pursuit-evasion game scenarios. The results of the Monte Carlo tests show that the successful pursuit ratio of the proposed method is over 91% for all the given scenarios.

Highlights

The space pursuit-evasion (PE) game is a typical zero-sum game [1,2], where the goals of both confrontation sides are completely opposite and irreconcilable
Wang [27] developed the improved branching deep Q networks and the fuzzy actor-critic learning algorithm, respectively. These previous researches usually restricted the initial distance between the two spacecraft to reduce the PE game duration and used a simplified dynamical model to improve the computational efficiency. To remove this limitation and Aerospace 2021, 8, 299 consider realistic space PE game problems, in this paper, a novel two-stage pursuit strategy is developed to find a robust solution for incomplete-information impulsive space pursuitevasion missions considering J2 perturbation
The required velocity increment for the close-distance PE game is the minimum if the evader does not perform any evasive maneuvers, which is equal to the sum of the Δv that were planned in far-distance rendezvous stage (FRS) but not executed increment is large

Summary

Introduction

The space pursuit-evasion (PE) game is a typical zero-sum game [1,2], where the goals of both confrontation sides are completely opposite and irreconcilable. It is challenging to develop a feedback closed-loop control method with high efficiency for the impulsive space PE game missions considering the perturbations of the dynamics. Wang [27] developed the improved branching deep Q networks and the fuzzy actor-critic learning algorithm, respectively These previous researches usually restricted the initial distance between the two spacecraft to reduce the PE game duration and used a simplified dynamical model to improve the computational efficiency. To remove this limitation and Aerospace 2021, 8, 299 consider realistic space PE game problems, in this paper, a novel two-stage pursuit strategy is developed to find a robust solution for incomplete-information impulsive space pursuitevasion missions considering J2 perturbation. The proposed method is applied to the scenarios of spacecraft games in the sun-synchronous orbit, which demonstrates outstanding advantages in robustness to various initial states of the pursuer and the evader and to the different evasion strategies

Dynamical Model with J2 Perturbation

Formulation of Non-Cooperation Target Pursuit Problem

Two-Stage Pursuit Strategy Using Reinforcement Learning

Multi-Impulse Pursuit Trajectory Optimization for FRS

DDPG-Based Pursuit Method for CGS

Deep Deterministic Policy Gradient Algorithm

Simulations and Analysis

Far-Distance

The perThe pursuit pursuittrajectory trajectoryofofthe theFRS

Close-Distance

Close-Distance Pursuit-Evasion Game

Pi wEi

Monte Carlo Analysis

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Aerospace	Publication Date: Oct 14, 2021
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Two-Stage Pursuit Strategy for Incomplete-Information Impulsive Space Pursuit-Evasion Mission Using Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Aerospace

Lead the way for us

Similar Papers

Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving
Yanliang Jin ... Leiji Zhu
Symmetry | VOL. 13
Yanliang Jin, et. al.Yanliang Jin ... Leiji Zhu
12 Jun 2021
Symmetry | VOL. 13

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Evgeny Neretin
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Evgeny Neretin
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

Energy Management for Hybrid Energy Storage System in Electric Based on Deep Deterministic Policy Gradient
Shuai Xia ... Chun Wang
International Journal of Computer Science and Information Technology | VOL. 2
Shuai Xia, et. al.Shuai Xia ... Chun Wang
22 Mar 2024
International Journal of Computer Science and Information Technology | VOL. 2

An Improved DDPG Algorithm with Barrier Function for Lane-Change Decision-Making of Intelligent Vehicles
Tianshuo Feng ... Xiaochuan Zhang
-
Tianshuo Feng, et. al.Tianshuo Feng ... Xiaochuan Zhang
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Stage Pursuit Strategy for Incomplete-Information Impulsive Space Pursuit-Evasion Mission Using Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Aerospace