Abstract

Unmanned aerial vehicles (UAVs) can be utilized as aerial base stations (BSs) to provide auxiliary communication services. In this letter, we propose a deep reinforcement learning (DRL)-based dynamic deployment method for multi-UAV communications. The phasic policy gradient (PPG) is designed to improve the sample efficiency and the attention of the multi-UAV deployment. Simulation results are provided to verify the effectiveness of the proposed method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call