Guidance Design for Escape Flight Vehicle against Multiple Pursuit Flight Vehicles Using the RNN-Based Proximal Policy Optimization Algorithm

Xiao Hu,Hongbo Wang,Min Gong,Tianshu Wang

doi:10.3390/aerospace11050361

Abstract

Guidance commands of flight vehicles can be regarded as a series of data sets having fixed time intervals; thus, guidance design constitutes a typical sequential decision problem and satisfies the basic conditions for using the deep reinforcement learning (DRL) technique. In this paper, we consider the scenario where the escape flight vehicle (EFV) generates guidance commands based on the DRL technique, while the pursuit flight vehicles (PFVs) derive their guidance commands employing the proportional navigation method. For every PFV, the evasion distance is described as the minimum distance between the EFV and the PFV during the escape-and-pursuit process. For the EFV, the objective of the guidance design entails progressively maximizing the residual velocity, which is described as the EFV’s velocity when the last evasion distance is attained, subject to the constraint imposed by the given evasion distance threshold. In the outlined problem, three dimensionalities of uncertainty emerge: (1) the number of PFVs requiring evasion at each time instant; (2) the precise time instant at which each of the evasion distances can be attained; (3) whether each attained evasion distance exceeds the given threshold or not. To solve the challenging problem, we propose an innovative solution that integrates the recurrent neural network (RNN) with the proximal policy optimization (PPO) algorithm, engineered to generate the guidance commands of the EFV. Initially, the model, trained by the RNN-based PPO algorithm, demonstrates effectiveness in evading a single PFV. Subsequently, the aforementioned model is deployed to evade additional PFVs, thereby systematically augmenting the model’s capabilities. Comprehensive simulation outcomes substantiate that the guidance design method based on the proposed RNN-based PPO algorithm is highly effective.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Guidance Design for Escape Flight Vehicle against Multiple Pursuit Flight Vehicles Using the RNN-Based Proximal Policy Optimization Algorithm

Abstract

Talk to us

Similar Papers

More From: Aerospace

Lead the way for us

Journal: Aerospace	Publication Date: Apr 30, 2024
License type: CC BY 4.0

Similar Papers

Study on deep reinforcement learning techniques for building energy consumption forecasting
Tao Liu ... Zhengfei Li
Energy and Buildings | VOL. 208
Tao Liu, et. al.Tao Liu ... Zhengfei Li
03 Dec 2019
Energy and Buildings | VOL. 208

Research on Behavioral Decision at an Unsignalized Roundabout for Automatic Driving Based on Proximal Policy Optimization Algorithm
Jingpeng Gan ... Jiancheng Zhang
Applied Sciences | VOL. 14
Jingpeng Gan, et. al.Jingpeng Gan ... Jiancheng Zhang
29 Mar 2024
Applied Sciences | VOL. 14

End-to-end autonomous driving using the Ape-X algorithm in Carla simulation environment
Maxence Hussonnois ... Jae-Yun Jun
-
Maxence Hussonnois, et. al.Maxence Hussonnois ... Jae-Yun Jun
05 Jul 2022
05 Jul 2022

Application of Deep Reinforcement Learning in Guandan Game
Jiahong Pan ... Zhongtian Zhang
-
Jiahong Pan, et. al.Jiahong Pan ... Zhongtian Zhang
15 Aug 2022
15 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Guidance Design for Escape Flight Vehicle against Multiple Pursuit Flight Vehicles Using the RNN-Based Proximal Policy Optimization Algorithm

Abstract

Talk to us

Similar Papers

More From: Aerospace