Abstract

Hybrid radio frequency (RF) and visible light communication (VLC) networks can provide high throughput and energy efficiency with VLC access points (APs) while ensuring ubiquitous coverage with RF APs. Due to dynamic channel conditions and limited resources, the hybrid RF/VLC networks’ resource allocation problem is complex and challenging. Conventional resource allocation techniques fail to overcome these challenges. Heuristic methods can solve high complexity problems; however, they are not robust against changes such as dynamic channel conditions or alternating user requirements. Heuristic methods require centralized control for stability which adds communication overhead between APs. Deep Reinforcement Learning (DRL) based solutions can solve high complexity, dynamic channel conditions, and alternating user requirements while not requiring centralized control. In this paper, we formulate a distributed downlink power allocation problem to optimize the transmit power for users to reach target data rates in hybrid RF/VLC networks. Then, we propose a distributed DRL-based algorithm Deep Deterministic Policy Gradient (DDPG), to solve the formulated computationally-intensive problem. We implement a simulation environment to benchmark the proposed distributed DRL-based method against other methods such as Q-Learning (QL) and Deep Q-Networks (DQN), and centralized heuristic power allocation algorithms. Our simulation results show that the distributed DDPG-based algorithm learns to adapt against changes in the channel or user requirements, while centralized Genetic Algorithm and Particle Swarm Optimization-based algorithms fail to endure against these changes even with coordination between APs. Additionally, we quantify the performance of the DDPG-based algorithm to prevail amid DRL-based algorithms at the expense of higher implementation complexity.

Highlights

  • The spectrum scarcity problem is becoming more critical for communication networks as the variety and the quantity of devices increase rapidly

  • There is no paper considering a distributed Deep Deterministic Policy Gradient (DDPG)-based algorithm to solve power allocation for hybrid radio frequency (RF)/Visible Light Communication (VLC) networks that can cope with a large number of users with mobility and dynamic channel conditions

  • We show that the DDPG-based algorithm has the best performance in terms of QoS and convergence performance compared to other Deep Reinforcement Learning (DRL)-based algorithms and heuristic algorithms

Read more

Summary

INTRODUCTION

The spectrum scarcity problem is becoming more critical for communication networks as the variety and the quantity of devices increase rapidly. We design and implement a novel distributed DDPG-based algorithm for the problem of power allocation for hybrid RF/VLC networks to provide multiple users’ QoS requirements regarding downlink data rates. There is no paper considering a distributed DDPG-based algorithm to solve power allocation for hybrid RF/VLC networks that can cope with a large number of users with mobility and dynamic channel conditions. In [1], we had applied DQN-based algorithm to solve power allocation optimization problem with two static users with static QoS requirements. We propose, design, and implement a distributed DDPG-based algorithm, to solve the power allocation optimization problem for a multiuser hybrid RF/VLC network with limited radio resources and dynamic channel conditions.

RELATED WORK
SYSTEM MODEL
PROBLEM FORMULATION
RL-BASED MULTIAGENT POWER ALLOCATION
Complexity Discussion
1: Initialization
22: Update the θRQ in critic network by minimizing the loss:
Simulation Parameters and Stability Discussion
NUMERICAL RESULTS
Performance Evaluation Before Training
Performance Evaluation After Training
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call