Research on Scheme Design and Decision of Multiple Unmanned Aerial Vehicle Cooperation Anti-Submarine Based on Knowledge-Driven Soft Actor-Critic

Xiaoyong Zhang,Wei Yue,Wenbin Tang

doi:10.3390/app132011527

Xiaoyong Zhang, Wei Yue + Show 1 more

Open Access

https://doi.org/10.3390/app132011527

Copy DOI

Journal: Applied Sciences	Publication Date: Oct 20, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Dalian Maritime University

Abstract

To enhance the anti-submarine and search capabilities of multiple Unmanned Aerial Vehicle (UAV) groups in complex marine environments, this paper proposes a flexible action-evaluation algorithm known as Knowledge-Driven Soft Actor-Critic (KD-SAC), which can effectively interact with real-time environmental information. KD-SAC is a reinforcement learning algorithm that consists of two main components: UAV Group Search Knowledge Base (UGSKB) and path planning strategy. Firstly, based on the UGSKB, we establish a cooperation search framework that comprises three layers of information models: the data layer provides prior information and fundamental search rules to the system, the knowledge layer enriches search rules and database in continuous searching processes, and the decision layer utilizes above two layers of information models to enable autonomous decision-making by UAVs. Secondly, we propose a rule-based deductive inference return visit (RDIRV) strategy to enhance the knowledge base of search. The core concept of this strategy is to enable UAVs to learn from both successful and unsuccessful experiences, thereby enriching the search rules based on optimal decisions as exemplary cases. This approach can significantly enhance the learning performance of KD-SAC. The subsequent step involves designing an event-based UGSKB calling mechanism at the decision-making level, which calls a template based on the target and current motion. Finally, it uses a punishment function, and is then employed to achieve optimal decision-making for UAV actions and states. The feasibility and superiority of our proposed algorithm are demonstrated through experimental comparisons with alternative methods. The final results demonstrate that the proposed method achieves a success rate of 73.63% in multi-UAV flight path planning within complex environments, surpassing the other three algorithms by 17.27%, 29.88%, and 33.51%, respectively. In addition, the KD-SAC algorithm outperforms the other three algorithms in terms of synergy and average search reward.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on Scheme Design and Decision of Multiple Unmanned Aerial Vehicle Cooperation Anti-Submarine Based on Knowledge-Driven Soft Actor-Critic

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Using On-line Simulation for Adaptive Path Planning of UAVs
Farzad Kamrani ... Rassul Ayani
-
Farzad Kamrani, et. al.Farzad Kamrani ... Rassul Ayani
01 Oct 2007
01 Oct 2007

Travelling salesman problem for UAV path planning with two parallel optimization algorithms
Jie Chen ... Fang Ye
-
Jie Chen, et. al.Jie Chen ... Fang Ye
01 Nov 2017
01 Nov 2017

Quality-aware UAV coverage and path planning in geometrically complex environments
Rui Dai ... Manish Kumar
Ad Hoc Networks | VOL. 73
Rui Dai, et. al.Rui Dai ... Manish Kumar
28 Feb 2018
Ad Hoc Networks | VOL. 73

Survey on computational-intelligence-based UAV path planning
Yijing Zhao ... Yang Liu
Knowledge-Based Systems | VOL. 158
Yijing Zhao, et. al.Yijing Zhao ... Yang Liu
13 Jun 2018
Knowledge-Based Systems | VOL. 158

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on Scheme Design and Decision of Multiple Unmanned Aerial Vehicle Cooperation Anti-Submarine Based on Knowledge-Driven Soft Actor-Critic

Abstract

Talk to us

Similar Papers

More From: Applied Sciences