Multi-Objective Optimization of Energy Saving and Throughput in Heterogeneous Networks Using Deep Reinforcement Learning.

Kyungho Ryu,Wooseong Kim

doi:10.3390/s21237925

Abstract

Wireless networking using GHz or THz spectra has encouraged mobile service providers to deploy small cells to improve link quality and cell capacity using mmWave backhaul links. As green networking for less CO2 emission is mandatory to confront global climate change, we need energy efficient network management for such denser small-cell heterogeneous networks (HetNets) that already suffer from observable power consumption. We establish a dual-objective optimization model that minimizes energy consumption by switching off unused small cells while maximizing user throughput, which is a mixed integer linear problem (MILP). Recently, the deep reinforcement learning (DRL) algorithm has been applied to many NP-hard problems of the wireless networking field, such as radio resource allocation, association and power saving, which can induce a near-optimal solution with fast inference time as an online solution. In this paper, we investigate the feasibility of the DRL algorithm for a dual-objective problem, energy efficient routing and throughput maximization, which has not been explored before. We propose a proximal policy (PPO)-based multi-objective algorithm using the actor-critic model that is realized as an optimistic linear support framework in which the PPO algorithm searches for feasible solutions iteratively. Experimental results show that our algorithm can achieve throughput and energy savings comparable to the CPLEX.

Highlights

Increasing mobile traffic accelerates the deployment of dense small cells operating on the 3 GHz spectrum under legacy macro cells, called a heterogeneous small cell network (HetNet), which offloads congested macro cells and eventually enhances quality of user experience (QoE)
The proximal policy (PPO)-based deep reinforcement learning (DRL) algorithm can suffer from finding Pareto fronts in the multiobjective Markov decision process (MDP) (MOMDP) problem since it just learns a policy with a scalarized single objective which is unclear to evaluate each contribution of different objectives
We implement a PPO-based DRL algorithm based on actor-critic architecture

Summary

Introduction

Increasing mobile traffic accelerates the deployment of dense small cells operating on the 3 GHz spectrum under legacy macro cells, called a heterogeneous small cell network (HetNet), which offloads congested macro cells and eventually enhances quality of user experience (QoE). To the best of our knowledge, this is the first work that investigates DRL to find the Pareto front of a multi-objective optimization problem of energy saving and throughput maximization in the HetNet with an mmWave-based multi-hop backhaul mesh. The PPO algorithm can provide an online policy for controlling backhaul transmission and SeNB power in HetNets, and it is simple to implement but comparable with the complicated trust region policy optimization (TRPO) [23] in terms of performance It is a challenge for the PPO algorithm to find an optimum of the multi-objective problem if only the reward sum of conflicting multi-objectives is given to an agent for training.

Related Works

Deep Q-Learning

Policy Gradient and Actor-Critic

System Model

Energy Consumption Model

AN Energy Consumption

BN Energy Consumption

Switch On and Off Model

Multi-Hop Routing Model

Link Capacity and Scheduling Model

Dual Objective Function

Deep Multi-Objective Reinforcement Learning in mmWave HetNet

Proximal Policy Optimization

PPO-Based DRL for HetNet Optimization

Multi-Objective Deep Reinforcement Learning

1: Initialization

Experiment

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Nov 27, 2021
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-Objective Optimization of Energy Saving and Throughput in Heterogeneous Networks Using Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

An explainable deep reinforcement learning algorithm for the parameter configuration and adjustment in the consortium blockchain
Zhonghao Zhai ... Yanqin Mao
Engineering Applications of Artificial Intelligence | VOL. 129
Zhonghao Zhai, et. al.Zhonghao Zhai ... Yanqin Mao
30 Nov 2023
Engineering Applications of Artificial Intelligence | VOL. 129

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Medical Equipment Supply Chain Optimization and Stability Study using Deep Reinforcement Learning
Zhuoxun Chen
Highlights in Science, Engineering and Technology | VOL. 68
Zhuoxun ChenZhuoxun Chen
09 Oct 2023
Highlights in Science, Engineering and Technology | VOL. 68

Space Manipulator Assembly Operation Technique based on Deep Residual Reinforcement Learning
Kui Huang ... Binyan Liang
Journal of Physics: Conference Series | VOL. 2405
Kui Huang, et. al.Kui Huang ... Binyan Liang
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2405

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Objective Optimization of Energy Saving and Throughput in Heterogeneous Networks Using Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors