Abstract

Unmanned aerial vehicle (UAV) communication is easily wiretapped by malignant nodes due to the broadcast nature of line-of-sight (LoS) wireless channels. To tackle this problem, this paper investigates a dual-UAV aided secure dynamic ground-to-UAV (G2U) communication system. By dynamic, we mean UAVs communicate with moving ground devices (GDs). Our objective is maximizing the sum secrecy rate by the joint optimization of UAV trajectory and GDs transmit power. To achieve it, we first formulate this nonconvex optimization problem as a Constrained Markov Decision Process (CMDP) under the constraints of UAV flying speed, initial and final locations, limited energy, and average transmit power. Then, a Deep Deterministic Policy Gradient (DDPG) based deep reinforcement learning algorithm is designed, named SC-TDPC, to learn the optimal transmit power and UAV trajectory. The experiment results demonstrate that, compared to other benchmark schemes, SC-TDPC can efficiently enhance the UAV communication security in terms of sum secrecy rate.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call