Abstract

Next generation wireless networks are expected to be extremely complex due to their massive heterogeneity in terms of the types of network architectures they incorporate, the types and numbers of smart IoT devices they serve, and the types of emerging applications they support. In such large-scale and heterogeneous networks (HetNets), radio resource allocation and management (RRAM) becomes one of the major challenges encountered during system design and deployment. In this context, emerging Deep Reinforcement Learning (DRL) techniques are expected to be one of the main enabling technologies to address the RRAM in future wireless HetNets. In this paper, we conduct a systematic in-depth, and comprehensive survey of the applications of DRL techniques in RRAM for next generation wireless networks. Towards this, we first overview the existing traditional RRAM methods and identify their limitations that motivate the use of DRL techniques in RRAM. Then, we provide a comprehensive review of the most widely used DRL algorithms to address RRAM problems, including the value- and policy-based algorithms. The advantages, limitations, and use-cases for each algorithm are provided. We then conduct a comprehensive and in-depth literature review and classify existing related works based on both the radio resources they are addressing and the type of wireless networks they are investigating. To this end, we carefully identify the types of DRL algorithms utilized in each related work, the elements of these algorithms, and the main findings of each related work. Finally, we highlight important open challenges and provide insights into several future research directions in the context of DRL-based RRAM. This survey is intentionally designed to guide and stimulate more research endeavors towards building efficient and fine-grained DRL-based RRAM schemes for future wireless networks.

Highlights

  • R ADIO resource allocation and management (RRAM) is regarded as one of the essential challenges encountered in modern wireless communication networks [1]

  • We observe that various DRL techniques can efficiently solve the power allocation optimization problems in diversified wireless network scenarios, and their performance outperforms the state-of-the-art heuristic approaches

  • Since the power allocation problem falls in the continuous action space, the use of valuebased algorithms to address these types of problems makes the learned policies vulnerable to discretization errors that degrade the accuracy and reliability of the learned models

Read more

Summary

INTRODUCTION

R ADIO resource allocation and management (RRAM) is regarded as one of the essential challenges encountered in modern wireless communication networks [1]. This will ensure ubiquitous connectivity for user devices with enhanced quality of service (QoS) in terms of coverage, reliability, and throughput. It is expected that by 2023, the number of user networked devices and connections, including smartphones, tablets, wearable devices, and sensors, will reach 29.3 billion [6], and generate a data rate exceeding 50 trillion GB [1] All these trends will exacerbate the burdens during system design, planning, deployment, operation, and management.

Motivations of the Paper
Related Work
Paper Contributions
RADIO RESOURCE ALLOCATION AND MANAGEMENT TECHNIQUES
Radio Resources
Conventional RRAM Techniques
Limitation of Conventional RRAM Techniques
Advantages of Using DRL-Based Techniques for RRAM
OVERVIEW OF DRL TECHNIQUES USED FOR RRAM
Value-Based Algorithms
Policy-Based Algorithm
Other DRL Algorithms
Multi-Agent DRL Algorithms
DRL for Power Allocation
Findings and Lessons Learned
DRL for Spectrum Allocation and Access Control
DRL for Rate Control
DRL for Joint RRAM
OPEN CHALLENGES AND FUTURE RESEARCH DIRECTIONS
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call