Real-Time Policy Optimization for UAV Swarms Based on Evolution Strategies

Zeyu Chen,Haiying Liu,Guohua Liu

doi:10.3390/drones8110619

Abstract

Multi-agent decision-making faces many challenges such as non-stationarity and sparse rewards, while the complexity and randomness of the real environment further complicate policy development. This paper addresses the high-dimensional policy optimization problems of unmanned aerial vehicle (UAV) swarms. By modeling the problem scenario as a Markov decision process, a real-time policy optimization algorithm based on evolution strategy (ES) pre-training is proposed. This approach combines decision-time planning with background planning to evaluate and integrate different sets of policy parameters in a temporal context. In the experimental phase, the policy network is trained using both ES and REINFORCE algorithms on a constructed simulation platform. Comparative experiments demonstrate the effectiveness of using ES for policy pre-training. Finally, the proposed real-time policy optimization algorithm further improves the performance of the swarm by approximately 10% in simulations, offering a feasible solution for adversarial games between swarms and extending the research scope of evolutionary algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-Time Policy Optimization for UAV Swarms Based on Evolution Strategies

Abstract

Talk to us

Similar Papers

More From: Drones

Lead the way for us

Journal: Drones	Publication Date: Oct 29, 2024
License type: CC BY 4.0

Similar Papers

UAVs as a Tool for Optimizing Boat-Supported Flood Evacuation Operations
Lara G Moussa ... Midhun Mohan
Drones | VOL. 8
Lara G Moussa, et. al.Lara G Moussa ... Midhun Mohan
29 Oct 2024
Drones | VOL. 8

A Review on Deep Learning for UAV Absolute Visual Localization
Andy Couturier ... Moulay A Akhloufi
Drones | VOL. 8
Andy Couturier, et. al.Andy Couturier ... Moulay A Akhloufi
29 Oct 2024
Drones | VOL. 8

Real-Time Policy Optimization for UAV Swarms Based on Evolution Strategies
Zeyu Chen ... Guohua Liu
Drones | VOL. 8
Zeyu Chen, et. al.Zeyu Chen ... Guohua Liu
29 Oct 2024
Drones | VOL. 8

Dynamic Path Planning Method for Unmanned Surface Vessels in Complex Traffic Conditions of Island Reefs Waters
Jing Peng ... Qi Zhao
Drones | VOL. 8
Jing Peng, et. al.Jing Peng ... Qi Zhao
29 Oct 2024
Drones | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-Time Policy Optimization for UAV Swarms Based on Evolution Strategies

Abstract

Talk to us

Similar Papers

More From: Drones