Autonomous trajectory planning method for hypersonic vehicles in glide phase based on DDPG algorithm

Cunyu Bao,Peng Wang,Ruizhi He,Guojian Tang

doi:10.1177/09544100221138911

Abstract

An autonomous optimal trajectory planning method based on the deep deterministic policy gradient (DDPG) algorithm of reinforcement learning (RL) for hypersonic vehicles (HV) is proposed in this paper. First, the trajectory planning problem is converted into a Markov Decision Process (MDP), and the amplitude of the bank angle is designated as the control input. The reward function of the MDP is set to minimize the trajectory terminal position errors with satisfying hard constraints. The deep neural network (DNN) is used to approximate the policy function and action-value function in the DDPG framework. The Actor network then computes the control input directly according to flight states. Using a limited exploration strategy, the optimal policy network would be considered fully trained when the reward value reached maximum convergence. Simulation results show that the policy network trained using a DDPG algorithm accomplishes 3-dimensional (3D) trajectory planning during the HV glide phase with high terminal precision and stable convergence. Additionally, the single step calculation time of the policy network occurs in near real time, which suggests great potential as an autonomous online trajectory planner. Monte Carlo experiments prove the strong robustness of the implementation of an autonomous trajectory planner under aerodynamic disturbances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Autonomous trajectory planning method for hypersonic vehicles in glide phase based on DDPG algorithm

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering

Lead the way for us

Journal: Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering	Publication Date: Nov 18, 2022
Citations: 5

Similar Papers

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Evgeny Neretin
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Evgeny Neretin
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving
Yanliang Jin ... Leiji Zhu
Symmetry | VOL. 13
Yanliang Jin, et. al.Yanliang Jin ... Leiji Zhu
12 Jun 2021
Symmetry | VOL. 13

Stability Analysis for Autonomous Vehicle Navigation Trained over Deep Deterministic Policy Gradient
Mireya Cabezas-Olivenza ... Ekaitz Zulueta
Mathematics | VOL. 11
Mireya Cabezas-Olivenza, et. al.Mireya Cabezas-Olivenza ... Ekaitz Zulueta
27 Dec 2022
Mathematics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Autonomous trajectory planning method for hypersonic vehicles in glide phase based on DDPG algorithm

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering