Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing.

Xibao Xu,Chengchao Bai,Yushen Chen

doi:10.3390/s21238161

Xibao Xu, Chengchao Bai + Show 1 more

Open Access

PDF Available

https://doi.org/10.3390/s21238161

Copy DOI

Export

Save

Cite

Journal: Sensors	Publication Date: Dec 6, 2021
Citations: 14	License type: CC BY 4.0

Affiliation: Harbin Institute of Technology

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Planetary soft landing has been studied extensively due to its promising application prospects. In this paper, a soft landing control algorithm based on deep reinforcement learning (DRL) with good convergence property is proposed. First, the soft landing problem of the powered descent phase is formulated and the theoretical basis of Reinforcement Learning (RL) used in this paper is introduced. Second, to make it easier to converge, a reward function is designed to include process rewards like velocity tracking reward, solving the problem of sparse reward. Then, by including the fuel consumption penalty and constraints violation penalty, the lander can learn to achieve velocity tracking goal while saving fuel and keeping attitude angle within safe ranges. Then, simulations of training are carried out under the frameworks of Deep deterministic policy gradient (DDPG), Twin Delayed DDPG (TD3), and Soft Actor Critic (SAC), respectively, which are of the classical RL frameworks, and all converged. Finally, the trained policy is deployed into velocity tracking and soft landing experiments, results of which demonstrate the validity of the algorithm proposed.

Highlights

Planetary soft landing has been studied extensively due to its promising application prospects
Based on the dynamic model established above, we will design an algorithm based on Reinforcement Learning (RL) according to the characteristics of soft landing problems, including the selection of observation values and the design of reward function and other settings concerning how the agent interacts the environment
The process reward is introduced in the landing process, that is, a reference velocity is given according to the real-time relative position between the lander and target landing area

Summary

Soft Landing Problem Formulation

The planetary surface fixed frame of reference is defined as Figure 1. As the powered descent begins at an altitude that is quite low compared to the planet’s radius, and the distance between the lander and target landing sites varies slightly during this phase, it is appropriate to assume that the planet’s gravity is a constant g. When it comes to the power descent phase, the lander has already released the parachute and the speed is on the order of 100 meters per second [22]. I where Isp is the specific impulse of the engine, and the inertia matrix will gradually decrease as the mass decreases. The shape of the lander is a cuboid of sides of length a × b × c with uniform mass distribution

RL Basis

Soft Landing with DRL

Reward Setting

Observation Space

Action Space

Network Architecture

Simulation Settings

Simulation Results

Conclusions

Degree of Freedom

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Towards a fossil-free urban transport system: An intelligent cross-type transferable energy management framework based on deep transfer reinforcement learning
Ruchen Huang ... Qicong Su
Applied Energy | VOL. 363
Ruchen Huang, et. al.Ruchen Huang ... Qicong Su
28 Mar 2024
Applied Energy | VOL. 363

Neural-network-based parameter tuning for multi-agent simulation using deep reinforcement learning
Masanori Hirano ... Kiyoshi Izumi
World Wide Web | VOL. 26
Masanori Hirano, et. al.Masanori Hirano ... Kiyoshi Izumi
03 Aug 2023
World Wide Web | VOL. 26

A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan ... Eleni Mangina
-
Anjukan Kathirgamanathan, et. al.Anjukan Kathirgamanathan ... Eleni Mangina
17 Nov 2020
17 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors