Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Yijun Hao,Shusen Yang,Fang Li,Cong Zhao

doi:10.1109/tnet.2022.3194953

Abstract

5G wireless networks are expected to satisfy different delay requirements of various traffics by network resource scheduling. Existing scheduling methods perform poorly in practice due to their unrealistic assumption on the access to the full channel state information (CSI) or the explicit mathematical expression of network delay. In this paper, we consider the delay-oriented packet scheduling problem in multi-cell 5G downlink networks with multiple users and traffic types (e.g., FTP, VoIP and video streaming), and formulate it as a partially observable Markov decision process (POMDP). We design a delay-oriented downlink scheduling framework based on deep reinforcement learning (DRL) to autonomously schedule the active traffic flows without the full channel information. Furthermore, a recurrent proximal policy optimization (RPPO) algorithm is proposed to perceive the underlying state and accelerate learning under different time granularities, with the policy gradient theorem under POMDP strictly proved. By incorporating the future traffic information provided by a proposed spatial-temporal prediction algorithm, RPPO can balance the load and achieve lower delay in real-time multi-cell multi-user scenarios. Results of extensive experiments on a realistic 5G simulator demonstrate that our framework significantly outperforms existing approaches in terms of both tail delay and average delay for up to 48% and 41.7%, respectively.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication

Lead the way for us

Journal: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication	Publication Date: Feb 1, 2023
Citations: 1

Similar Papers

Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning
Pihe Hu ... Longbo Huang
-
Pihe Hu, et. al.Pihe Hu ... Longbo Huang
03 Oct 2022
03 Oct 2022

Tractable POMDP-planning for robots with complex non-linear dynamics
Marcus Hoerger
-
Marcus HoergerMarcus Hoerger
16 Mar 2020
16 Mar 2020

A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
Tuyen P Le ... Ngo Anh Vien
IEEE access : practical innovations, open solutions | VOL. 6
Tuyen P Le, et. al.Tuyen P Le ... Ngo Anh Vien
01 Jan 2018
IEEE access : practical innovations, open solutions | VOL. 6

Dynamic spectrum access with deep Q-learning in densely occupied and partially observable environments
Slavica Tomović ... Igor Radusinović
Telfor Journal | VOL. 13
Slavica Tomović, et. al.Slavica Tomović ... Igor Radusinović
01 Jan 2020
Telfor Journal | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication