Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.

Xiaoqiang Yu,Zexu Zhang,Ping Wang

doi:10.3390/s21030796

Xiaoqiang Yu, Zexu Zhang + Show 1 more

Open Access

https://doi.org/10.3390/s21030796

Copy DOI

Abstract

Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terrain data was built using the Gazebo simulation environment and a lunar rover simulator was created in it to simulate the real lunar surface environment and the lunar rover system. Then an end-to-end path planning algorithm based on deep reinforcement learning method is designed, including state space, action space, network structure, reward function considering slip behavior, and training method based on proximal policy optimization. In addition, to improve the generalization ability to different lunar surface topography and different scale environments, a variety of training scenarios were set up to train the network model using the idea of curriculum learning. The simulation results show that the proposed planning algorithm can successfully achieve the end-to-end path planning of the lunar rover, and the path generated by the proposed algorithm has a higher safety guarantee compared with the classical path planning algorithm.

Highlights

As the closest celestial body to the Earth in the universe, the Moon is the main goal of human beings for deep space exploration because of its great location advantage and abundant material resources
(2) We propose a learning-based endto-end path planning algorithm with safety constraints, in which a safety reward function considering the sliding behavior of the lunar rover is designed, and the sliding rate of the lunar rover is predicted based on the slope angle of the terrain in which the lunar rover is located, and it is used as a reward feedback for the current state to improve the safety assurance of the lunar rover autonomous exploration process
Aiming at the problem of autonomous exploration path planning for lunar rover, this paper proposes a learning-based end-to-end path planning algorithm with safety constraints

Summary

Introduction

As the closest celestial body to the Earth in the universe, the Moon is the main goal of human beings for deep space exploration because of its great location advantage and abundant material resources. With the gradual understanding of the Moon, the main lunar exploration goals of the world’s major aerospace nations in the future will focus on the development and utilization of lunar resources, the establishment of lunar bases, and the way to deep space through the moon. In the lunar exploration plans of various countries, the lunar rover, as the executive body and an important part of the lunar exploration mission, is the main research object. The autonomous movement and detection capability of lunar rover will undoubtedly provide a more autonomous and robust detection mode for lunar exploration, and greatly improve the efficiency of lunar exploration

Objectives

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jan 25, 2021
Citations: 36	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Integrated path planning and control through proximal policy optimization for a marine current turbine
Arezoo Hasankhani ... James Vanzwieten
Applied Ocean Research | VOL. 137
Arezoo Hasankhani, et. al.Arezoo Hasankhani ... James Vanzwieten
06 Jun 2023
Applied Ocean Research | VOL. 137

Building Energy Consumption Prediction Using a Deep-Forest-Based DQN Method
Qiming Fu ... Ke Li
Buildings | VOL. 12
Qiming Fu, et. al.Qiming Fu ... Ke Li
27 Jan 2022
Buildings | VOL. 12

Efficient Deep Reinforcement Learning for Optimal Path Planning
Jing Ren ... Raymond N Huang
Electronics | VOL. 11
Jing Ren, et. al.Jing Ren ... Raymond N Huang
07 Nov 2022
Electronics | VOL. 11

Policy Optimization of the Power Allocation Algorithm Based on the Actor–Critic Framework in Small Cell Networks
Haibo Chen ... Xiaorong Zhao
Mathematics | VOL. 11
Haibo Chen, et. al.Haibo Chen ... Xiaorong Zhao
02 Apr 2023
Mathematics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)