A Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation Framework for D2D Communications

Zheng Li,Yidi Xuan,Caili Guo

doi:10.1109/globecom38437.2019.9013763

Abstract

Device-to-device (D2D) communication has been recognized as a promising technique to improve spectrum efficiency. However, D2D transmission as an underlay causes severe interference, which imposes a technical challenge to spectrum allocation. Existing centralized schemes require global information, which can cause serious signaling overhead. While existing distributed solution requires frequent information exchange between users and cannot achieve global optimization. In this paper, a distributed spectrum allocation framework based on multi-agent deep reinforcement learning is proposed, named Neighbor-Agent Actor Critic (NAAC). NAAC uses neighbor users' historical information for centralized training but is executed distributedly without that information, which not only has no signal interaction during execution, but also utilizes cooperation between users to further optimize system performance. The simulation results show that the proposed framework can effectively reduce the outage probability of cellular links, improve the sum rate of D2D links and have good convergence.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation Framework for D2D Communications

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications
Zheng Li ... Caili Guo
IEEE Transactions on Vehicular Technology | VOL. 69
Zheng Li, et. al.Zheng Li ... Caili Guo
10 Jan 2020
IEEE Transactions on Vehicular Technology | VOL. 69

MADRL Based Uplink Joint Resource Block Allocation and Power Control in Multi-Cell Systems
Yuhan Yang ... Tiejun Lv
-
Yuhan Yang, et. al.Yuhan Yang ... Tiejun Lv
01 Mar 2023
01 Mar 2023

Multi-Agent Reinforcement Learning Based Fully Decentralized Dynamic Time Division Configuration for 5G and B5G Network.
Xiangyu Chen ... Gang Chuai
Sensors (Basel, Switzerland) | VOL. 22
Xiangyu Chen, et. al.Xiangyu Chen ... Gang Chuai
23 Feb 2022
Sensors (Basel, Switzerland) | VOL. 22

Multi-Agent Deep Reinforcement Learning for Walkers
Inhee Park
-
Inhee ParkInhee Park
24 Feb 2021
24 Feb 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation Framework for D2D Communications

Abstract

Talk to us

Similar Papers