A Deep Reinforcement Learning Scheme for Sum Rate and Fairness Maximization Among D2D Pairs Underlaying Cellular Network With NOMA

Vineet Vishnoi,Ishan Budhiraja,Neeraj Kumar,Suneet Gupta

doi:10.1109/tvt.2023.3276647

Abstract

Device-to-device (D2D) communication is an emerging technology in 5G and upcoming 6G networks due to its properties to enhanced spectral efficiency (SE), energy-efficiency (EE), and sum rate. Despite these advantages, co-channel and cross-channel interference, and ultra-massive connectivity are major issues which can deteriorate performance of any implemented solution in this environment. To address these issues, in this paper, we integrated the power domain non-orthogonal multiple access techniques (PD-NOMA) on the base station (BS). NOMA serves more than one user using the same resource block (RB) and reduces the effect of interference at CUs due to the presence of successive interference cancellation (SIC). The problem is formulated as a mixed-integer non-linear programming (MINLP) with associated resources and power constraints of the BS and DDPs with an aim to maximize the sum rate and fairness among the NOMA-enabled CUs and D2D pairs (DDPs). We firstly used the centralized deep deterministic policy gradient (DDPG) and arithmetic-geometric mean approximation (AGMA) technique to reduce cross-channel interference (CR-CI) and control the power. Then, to provide fairness to all the users, we transformed the proposed solution into distributed deep deterministic policy gradient (D3PG). Also, the successive convex approximation technique is then integrated into the D3PG to mitigate the effect of co-channel (CO-CI) interference among DDPs. The experimental results show that the proposed scheme has superior performance with respect to sum rate and fairness. Also, the results reveal that the proposed scheme has 21.05%, 34.21%, and 49.8% higher sum rate in comparison to DDPG, Deep dueling, and DQN scheduling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Deep Reinforcement Learning Scheme for Sum Rate and Fairness Maximization Among D2D Pairs Underlaying Cellular Network With NOMA

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology

Lead the way for us

Journal: IEEE Transactions on Vehicular Technology	Publication Date: Oct 1, 2023
Citations: 15

Similar Papers

Interference Mitigation and Secrecy Ensured for NOMA-Based D2D Communications Under Imperfect CSI
Ishan Budhiraja ... Joel J P C Rodrigues
-
Ishan Budhiraja, et. al.Ishan Budhiraja ... Joel J P C Rodrigues
01 Jun 2021
01 Jun 2021

Cross-Layer Interference Management Scheme for D2D Mobile Users Using NOMA
Ishan Budhiraja ... Sudhanshu Tyagi
IEEE Systems Journal | VOL. 15
Ishan Budhiraja, et. al.Ishan Budhiraja ... Sudhanshu Tyagi
22 Jun 2020
IEEE Systems Journal | VOL. 15

Two-stage coalition formation and radio resource allocation with Nash bargaining solution for inband underlaid D2D communications in 5G networks
Chih-Cheng Tseng ... Jyun-Yao Shih
Journal of Network and Computer Applications | VOL. 111
Chih-Cheng Tseng, et. al.Chih-Cheng Tseng ... Jyun-Yao Shih
20 Mar 2018
Journal of Network and Computer Applications | VOL. 111

Resource Allocation using Power Domain Non-Orthogonal Multiple Access
Gautham Vinod ... Anjana K Menon
-
Gautham Vinod, et. al.Gautham Vinod ... Anjana K Menon
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Deep Reinforcement Learning Scheme for Sum Rate and Fairness Maximization Among D2D Pairs Underlaying Cellular Network With NOMA

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology