Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping

Weiren Kong,Zhen Yang,Deyun Zhou

doi:10.1109/cvidl51233.2020.000-7

Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping

Weiren Kong, Zhen Yang + Show 1 more

https://doi.org/10.1109/cvidl51233.2020.000-7

Copy DOI

Publication Date: Jul 1, 2020

Citations: 7

Affiliation: Northwestern Polytechnical University

#Air Combat #Multi-agent Deep Deterministic Policy Gradient + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The intelligence of the computer-generated force (CGF) is one of the important problems in air combat simulation. The air combat of CGF is modeled as a two-player zero-sum Markov game. An air combat strategies generation method of CGF is proposed to use a multi-agent deep deterministic policy gradient (MADDPG) algorithm. This paper proposes a potential-based reward shaping method to improve the efficiency of the air combat policy generation algorithm. Finally, the efficiency of the air combat policy generation algorithm and the intelligence level of the resulting policy is verified through simulation experiments. The simulation results show that this method has good convergence and better air combat performance with the strategy obtained by the DDPG algorithm.

Full Text