Abstract

For jammer formations, due to advanced digital signal processing and precise synchronization, suppressing and penetrating netted radar systems face more challenges than ever. In this letter, a deep reinforcement learning-based joint path planning and jamming power allocation optimization (DRL-JPPAO) method is proposed. DRL-JPPAO models the penetration operation by a Markov Decision Process and utilizes the Proximal Policy Optimization algorithm to learn optimal sequential actions, which consist of waypoints and jamming power allocation matrices, to solve it. The learning process is guided by a reward function that depends on the success or failure of penetration and the distance to the target. Empirical results with a newly developed environment show that DRL-JPPAO can learn optimal paths and jamming power allocation strategies jointly, accomplishing the penetration operation effectively and elegantly.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call