Hybrid NOMA/OMA-Based Dynamic Power Allocation Scheme Using Deep Reinforcement Learning in 5G Networks

Hoang Thi Huong Giang,Insoo Koo,Pham Duy Thanh,Tran Nhut Khai Hoan

doi:10.3390/app10124236

Hoang Thi Huong Giang, Insoo Koo + Show 2 more

Open Access

https://doi.org/10.3390/app10124236

Copy DOI

Journal: Applied Sciences	Publication Date: Jun 20, 2020
Citations: 18	License type: CC BY 4.0

Affiliation: University of Ulsan, Can Tho University

Abstract

Non-orthogonal multiple access (NOMA) is considered a potential technique in fifth-generation (5G). Nevertheless, it is relatively complex when applying NOMA to a massive access scenario. Thus, in this paper, a hybrid NOMA/OMA scheme is considered for uplink wireless transmission systems where multiple cognitive users (CUs) can simultaneously transmit their data to a cognitive base station (CBS). We adopt a user-pairing algorithm in which the CUs are grouped into multiple pairs, and each group is assigned to an orthogonal sub-channel such that each user in a pair applies NOMA to transmit data to the CBS without causing interference with other groups. Subsequently, the signal transmitted by the CUs of each NOMA group can be independently retrieved by using successive interference cancellation (SIC). The CUs are assumed to harvest solar energy to maintain operations. Moreover, joint power and bandwidth allocation is taken into account at the CBS to optimize energy and spectrum efficiency in order to obtain the maximum long-term data rate for the system. To this end, we propose a deep actor-critic reinforcement learning (DACRL) algorithm to respectively model the policy function and value function for the actor and critic of the agent (i.e., the CBS), in which the actor can learn about system dynamics by interacting with the environment. Meanwhile, the critic can evaluate the action taken such that the CBS can optimally assign power and bandwidth to the CUs when the training phase finishes. Numerical results validate the superior performance of the proposed scheme, compared with other conventional schemes.

Highlights

Fourth-generation (4G) systems reached maturity, and will evolve into fifth-generation (5G) systems where limited amounts of new spectrum can be utilized to meet the stringent demands of users
We study a model of a hybrid non-orthogonal multiple access (NOMA)/orthogonal multiple access (OMA) uplink cognitive radio network adopting energy harvesting at the cognitive users (CUs), where solar energy-powered CUs opportunistically use the licensed channel of the primary network to transmit data to a cognitive base station using NOMA/OMA
We investigate the performance of uplink NOMA systems using our proposed scheme

Summary

Introduction

Fourth-generation (4G) systems reached maturity, and will evolve into fifth-generation (5G) systems where limited amounts of new spectrum can be utilized to meet the stringent demands of users. Studied solutions for user pairing, and investigated the power allocation problem by using NOMA in cognitive radio (CR) networks. The user-pairing approach was studied in many predefined power allocation schemes in NOMA communication systems [24] in which internet of things (IoT) devices first harvest energy from. Zhang et al [33] proposed an asynchronous advantage, deep actor-critic-based scheme to optimize spectrum sharing efficiency and guarantee the QoS requirements of PUs and CUs. To the best of our knowledge, there has been little research into resource allocation using deep reinforcement learning under a non-RF energy-harvesting scenario in uplink cognitive radio networks. We propose a deep actor-critic reinforcement learning framework for efficient joint power and bandwidth allocation by adopting hybrid NOMA/OMA in uplink cognitive radio networks (CRNs).

System Model

Long-Term Transmission Rate Maximization Problem Formulation

Deep Reinforcement Learning-Based Resource Allocation Policy

Markov Decision Process

Silent Mode

Transmission Mode

Deep Actor-Critic Reinforcement Learning Algorithm

The Critic with a DNN

The Actor with a DNN

Simulation Results

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid NOMA/OMA-Based Dynamic Power Allocation Scheme Using Deep Reinforcement Learning in 5G Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Security-reliability trade-off for cognitive radio networks in the presence of eavesdropping attack
Jia Zhu
EURASIP Journal on Advances in Signal Processing | VOL. 2013
Jia ZhuJia Zhu
06 Nov 2013
EURASIP Journal on Advances in Signal Processing | VOL. 2013

Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor–critic reinforcement learning approach
Hoang Thi Huong Giang ... Insoo Koo
Wireless Networks | VOL. 27
Hoang Thi Huong Giang, et. al.Hoang Thi Huong Giang ... Insoo Koo
02 Jan 2020
Wireless Networks | VOL. 27

Joint uplink user scheduling and power allocation in cognitive MIMO system
Kai Zhu ... Kai Niu
-
Kai Zhu, et. al. Kai Zhu ... Kai Niu
01 Jan 2010
01 Jan 2010

Improving 6G Network Spectrum Efficiency with Non-Cooperative and Cooperative Spectrum Sharing Using NOMA and Massive-MIMO
Mohamed Hassan ... Imadeldin Elsayed
ICST Transactions on Mobile Communications and Applications | VOL. 8
Mohamed Hassan, et. al.Mohamed Hassan ... Imadeldin Elsayed
23 Oct 2023
ICST Transactions on Mobile Communications and Applications | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid NOMA/OMA-Based Dynamic Power Allocation Scheme Using Deep Reinforcement Learning in 5G Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences