Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach.

Xiangjun Li,Jinli Zhai,Qimei Cui,Xueqing Huang

doi:10.3390/s21134404

Abstract

The demand for bandwidth-intensive and delay-sensitive services is surging daily with the development of 5G technology, resulting in fierce competition for scarce radio resources. Power domain Nonorthogonal Multiple Access (NOMA) technologies can dramatically improve system capacity and spectrum efficiency. Unlike existing NOMA scheduling that mainly focuses on fairness, this paper proposes a power control solution for uplink hybrid OMA and PD-NOMA in dual dynamic environments: dynamic and imperfect channel information together with the random user-specific hierarchical quality of service (QoS). This paper models the power control problem as a nonconvex stochastic, which aims to maximize system energy efficiency while guaranteeing hierarchical user QoS requirements. Then, the problem is formulated as a partially observable Markov decision process (POMDP). Owing to the difficulty of modeling time-varying scenes, the urgency of fast convergency, the adaptability in a dynamic environment, and the continuity of the variables, a Deep Reinforcement Learning (DRL)-based method is proposed. This paper also transforms the hierarchical QoS constraint under the NOMA serial interference cancellation (SIC) scene to fit DRL. The simulation results verify the effectiveness and robustness of the proposed algorithm under a dual uncertain environment. As compared with the baseline Particle Swarm Optimization algorithm (PSO), the proposed DRL-based method has demonstrated satisfying performance.

Highlights

IntroductionPublisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations
As a trend of 6G network development, the performance of Nonorthogonal Multiple Access (NOMA) depends on user pairing, power allocation, and detection-decoding, which are closely related to NOMA performance [2]
Learning to represent the quality of service (QoS) requirement; Considering the dual dynamics of the channel and user requirements, nonconvex optimization problem, this paper proposes a deep deterministic policy gradient (DDPG)-based method; Simulation results show that compared with the global search algorithm Particle Swarm Optimization algorithm (PSO), the DDPG

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. Due to the dual uncertainty in wireless network environments: channel as well as QoS requirements change with users and time; these conventional model-based approaches that require the complete knowledge of systems and high computational complexity can be inefficient or even infeasible in practice. This research proposes an uplink power allocation algorithm under hybrid NOMA. It considers the environment under dual uncertainty, which means imperfect time-varying channel information and random users’ hierarchical QoS requirements. Learning to represent the QoS requirement; Considering the dual dynamics of the channel and user requirements, nonconvex optimization problem, this paper proposes a DDPG-based method; Simulation results show that compared with the global search algorithm PSO, the DDPG method is more adaptable to dynamic environments and has a faster convergence speed.

Related Work

System Model and Problem Formulation

Network Framework and Objective Function

Optimization Problem and QoS Constraint Transformation

Pairing of Near and Far Users

Learning Agent Design

Simulation Parameters

Simulation Result

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jun 27, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Guaranteed Multimedia Services over Satellite Networks
Anand Srinivasan ... Peter Andreadis
-
Anand Srinivasan, et. al.Anand Srinivasan ... Peter Andreadis
15 Jun 2006
15 Jun 2006

Energy-Efficient Resource Allocation for Downlink Non-Orthogonal Multiple Access Systems
Yue Cui ... Peng Liu
Applied Sciences | VOL. 12
Yue Cui, et. al.Yue Cui ... Peng Liu
27 Sep 2022
Applied Sciences | VOL. 12

Modified Hungarian User Pairing Method for NOMA-Based 5G Networks
Tadepalli Sri Bala Ragni ... V P Harigovindan
-
Tadepalli Sri Bala Ragni, et. al.Tadepalli Sri Bala Ragni ... V P Harigovindan
01 Dec 2019
01 Dec 2019

Advanced NOMA Receivers From a Unified Variational Inference Perspective
Xiangming Meng ... Yiqun Wu
IEEE Journal on Selected Areas in Communications | VOL. 39
Xiangming Meng, et. al.Xiangming Meng ... Yiqun Wu
31 Aug 2020
IEEE Journal on Selected Areas in Communications | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)