Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Zheng Li,Caili Guo

doi:10.1109/tvt.2019.2961405

Abstract

Device-to-device (D2D) communication underlay cellular networks is a promising technique to improve spectrum efficiency. In this situation, D2D transmission may cause severe interference to both the cellular and other D2D links, which imposes a great technical challenge to spectrum allocation. Existing centralized schemes require global information, which causes a large signaling overhead. While existing distributed schemes requires frequent information exchange among D2D users and cannot achieve global optimization. In this paper, a distributed spectrum allocation framework based on multi-agent deep reinforcement learning is proposed, named multi-agent actor critic (MAAC). MAAC shares global historical states, actions and policies during centralized training, requires no signal interaction during execution and utilizes cooperation among users to further optimize system performance. Moreover, in order to decrease the computing complexity of the training, we further propose the neighbor-agent actor critic (NAAC) based on the neighbor users’ historical information for centralized training. The simulation results show that the proposed MAAC and NAAC can effectively reduce the outage probability of cellular links, greatly improve the sum rate of D2D links and converge quickly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology

Lead the way for us

Journal: IEEE Transactions on Vehicular Technology	Publication Date: Jan 10, 2020
Citations: 138

Similar Papers

A Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation Framework for D2D Communications
Zheng Li ... Caili Guo
-
Zheng Li, et. al.Zheng Li ... Caili Guo
01 Dec 2019
01 Dec 2019

Outage Probability Analysis for Two-Way Amplify-and-Forward Mobile Relay Assisted Device-to-Device Communication in Rayleigh Fading Channel
Weipeng Wang ... Jihong Zhao
Journal of Physics: Conference Series | VOL. 2218
Weipeng Wang, et. al.Weipeng Wang ... Jihong Zhao
01 Mar 2022
Journal of Physics: Conference Series | VOL. 2218

Cooperative Spectrum Sharing in D2D-Enabled Cellular Networks
Chuan Ma ... Hui Yu
IEEE Transactions on Communications | VOL. 64
Chuan Ma, et. al.Chuan Ma ... Hui Yu
01 Jan 2015
IEEE Transactions on Communications | VOL. 64

Deep Reinforcement Learning Based Resource Allocation for D2D Communications Underlay Cellular Networks
Seoyoung Yu ... Jeong Woo Lee
Sensors | VOL. 22
Seoyoung Yu, et. al.Seoyoung Yu ... Jeong Woo Lee
03 Dec 2022
Sensors | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology