A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space.

Xiao Wang,Yushan Zhao,Peng Shi,Yue Sun

doi:10.3390/s20082253

Abstract

In order to help the pursuer find its advantaged control policy in a one-to-one game in space, this paper proposes an innovative pre-trained fuzzy reinforcement learning algorithm, which is conducted in the x, y, and z channels separately. Compared with the previous algorithms applied in ground games, this is the first time reinforcement learning has been introduced to help the pursuer in space optimize its control policy. The known part of the environment is utilized to help the pursuer pre-train its consequent set before learning. An actor-critic framework is built in each moving channel of the pursuer. The consequent set of the pursuer is updated through the gradient descent method in fuzzy inference systems. The numerical experimental results validate the effectiveness of the proposed algorithm in improving the game ability of the pursuer.

Highlights

Tracking space targets is beneficial for orbital garbage removal, recovery of important components, and early warning of space threats [1]
The differential game in space has more complex dynamics; it will be extremely hard to solve without any prior information
In this paper, we propose an innovative pre-trained fuzzy reinforcement learning (PTFRL) algorithm to help the pursuer optimize its control policy through a pre-training process

Summary

Introduction

Tracking space targets is beneficial for orbital garbage removal, recovery of important components, and early warning of space threats [1]. The differential game in space has more complex dynamics; it will be extremely hard to solve without any prior information To overcome this shortcoming, in this paper, we propose an innovative pre-trained fuzzy reinforcement learning (PTFRL) algorithm to help the pursuer optimize its control policy through a pre-training process. This paper applies a pre-trained fuzzy reinforcement learning algorithm to optimize the control policy of a pursuer, which is used for a one-to-one game in outer space. The main improvements of this paper are as follows: (1) Unlike the previous control laws, which were designed based on the adaptive control theory, for the first time, we utilize the technique of reinforcement learning to help the pursuer track a moving non-cooperative target in space. The structure of this paper is as follows: Section 2 presents the dynamics of the pursuer and the evader; Section 3 discusses the fuzzy inference system and its combination with reinforcement learning for continuous systems; Section 4 applies the pre-trained fuzzy reinforcement learning algorithm for the pursuer; Section 5 simulates the proposed algorithm; Section 6 discusses the experimental results; Section 7 draws the conclusions

Dynamics of the Space Differential Game

Reinforcement Learning in Continuous Systems

The Fuzzy Inference System

The Fuzzy Actor-Critic Learning Algorithm

Fuzzy Reinforcement Learning Algorithm

Pre-Training Process Based on the Genetic Algorithm

Simulation

Findings

Discussion

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Apr 16, 2020
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Design of Parameter-Self-Tuning Controller Based on Reinforcement Learning for Tracking Noncooperative Targets in Space
Xiao Wang ... Peng Shi
IEEE Transactions on Aerospace and Electronic Systems | VOL. 56
Xiao Wang, et. al.Xiao Wang ... Peng Shi
24 Apr 2020
IEEE Transactions on Aerospace and Electronic Systems | VOL. 56

Self-Learning in Obstacle Avoidance of a Mobile Robot via Dynamic Self-Generated Fuzzy Q-Learning
Yi Zhou ... Meng Er
-
Yi Zhou, et. al.Yi Zhou ... Meng Er
01 Jan 2006
01 Jan 2006

A Bayesian game based adaptive fuzzy controller for multiagent POMDPs
Rajneesh Sharma ... Matthijs T J Spaan
-
Rajneesh Sharma, et. al.Rajneesh Sharma ... Matthijs T J Spaan
01 Jul 2010
01 Jul 2010

A deep actor critic reinforcement learning framework for learning to rank
Vaibhav Padhye ... Kailasam Lakshmanan
Neurocomputing | VOL. 547
Vaibhav Padhye, et. al.Vaibhav Padhye ... Kailasam Lakshmanan
18 May 2023
Neurocomputing | VOL. 547

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)