Abstract

Most of the existing anti-jamming decision-making methods overly rely on the subjective experience of radar operators. However, due to the rapid development of cognitive radar and modern electronic warfare, conventional anti-jamming decision-making methods can no longer adapt to the complex and changing electromagnetic environment. The advent of deep reinforcement learning (DRL) provides a new attractive solution for this issue. In this paper, an adversarial anti-jamming decision-making network for cognitive radar via multi-agent deep reinforcement learning (MDRL) is proposed, which has good self-learning ability and can meet the requirements of intelligent, dynamic and real-time in modern electronic warfare. Since competitive decision-makers are considered and these two confrontational sides are not able to obtain the completely accurate information of each other, the environment model is specifically constructed as a partially observable Markov decision process (POMDP). Then, a decision-making network is designed based on deterministic deep deterministic policy gradient (DDPG) algorithm to explore the competition between cognitive radar and smart jammer. In order to overcome the environment non-stationarity, the decision-making network is trained and tested in a special MDRL framework. The experimental results demonstrate that the proposed method is effective in anti-jamming decision-making system of cognitive radar. Furthermore, the two confrontational sides show high decision-making ability and perform well in the adversarial scenario by comparing with other training policies, which demonstrate that confrontational training with powerful opponents can improve the intelligence level of all agents.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call