A Turbo Q-Learning (TQL) for Energy Efficiency Optimization in Heterogeneous Networks

Xiumin Wang,Zhengquan Li,Jun Li,Lei Li

doi:10.3390/e22090957

Abstract

In order to maximize energy efficiency in heterogeneous networks (HetNets), a turbo Q-Learning (TQL) combined with multistage decision process and tabular Q-Learning is proposed to optimize the resource configuration. For the large dimensions of action space, the problem of energy efficiency optimization is designed as a multistage decision process in this paper, according to the resource allocation of optimization objectives, the initial problem is divided into several subproblems which are solved by tabular Q-Learning, and the traditional exponential increasing size of action space is decomposed into linear increase. By iterating the solutions of subproblems, the initial problem is solved. The simple stability analysis of the algorithm is given in this paper. As to the large dimension of state space, we use a deep neural network (DNN) to classify states where the optimization policy of novel Q-Learning is set to label samples. Thus far, the dimensions of action and state space have been solved. The simulation results show that our approach is convergent, improves the convergence speed by 60% while maintaining almost the same energy efficiency and having the characteristics of system adjustment.

Highlights

With the dramatic growing number of wireless devices, more stringent requirements are put forward for performance and energy efficiency of heterogeneous networks (HetNets) [1]
(2) The turbo Q-Learning (TQL) is proposed by combining traditional Q-Learning and multistage decision process which has a loop iteration structure, each sub-Q-Learning solving each sub-problem which is from an original optimization problem
In order to jointly optimize resources to maximize the energy efficiency of HetNets by reinforcement learning (RL), there is a problem of too large action space and state space

Summary

Introduction

With the dramatic growing number of wireless devices, more stringent requirements are put forward for performance and energy efficiency of heterogeneous networks (HetNets) [1]. In this paper, inspired from the previous works [2,4,5,6,7,8,9,10], referring to RL and the idea of converting non-convex NP hard problem into several sub-problems, a turbo QL (TQL) scheme is proposed to optimize energy efficiency in which the traditional QL algorithm is decomposed into several sub-Q-Learning algorithms and has a loop iteration structure, each sub-Q-learning solving each sub-problem. (2) The TQL is proposed by combining traditional Q-Learning and multistage decision process which has a loop iteration structure, each sub-Q-Learning solving each sub-problem which is from an original optimization problem. It effectively deals with the dimensional explosion problem caused by the action space increasing in RL and greatly reduces the complexity of optimization problems.

Traditional Methods for Optimizing Hetnets

Machine Learning for Optimizing Hetnets

Neural Networks for Optimizing Hetnets

System Model

Problem Formulation

Solution with a Tql Algorithm

Q-Learning Algorithm

Sub-Problem A

Neural Network for the Classification of States

Numerical Simulation

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Aug 30, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Turbo Q-Learning (TQL) for Energy Efficiency Optimization in Heterogeneous Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Optimizing energy efficiency in heterogeneous networks: An integrated stochastic geometry approach with novel sleep mode strategies and QoS framework.
Ji-Hoon Yun ... Faizan Shirazi
PloS one | VOL. 19
Ji-Hoon Yun, et. al.Ji-Hoon Yun ... Faizan Shirazi
26 Feb 2024
PloS one | VOL. 19

Energy and spectral efficiency for heterogeneous cellular networks with stochastic deployment
Mahmut Demirtas ... Caggatay Sagginda
-
Mahmut Demirtas, et. al.Mahmut Demirtas ... Caggatay Sagginda
01 May 2017
01 May 2017

Energy Efficiency of Heterogeneous Air-Ground Cellular Networks
Jie Xin ... Liqiang Zhao
-
Jie Xin, et. al.Jie Xin ... Liqiang Zhao
01 Oct 2017
01 Oct 2017

An Alternating Optimization Algorithm for Energy Efficiency in Heterogeneous Networks
Kha Ha ... Tien Ha
Journal of Science and Technology: Issue on Information and Communications Technology | VOL. 4
Kha Ha, et. al.Kha Ha ... Tien Ha
30 Sep 2018
Journal of Science and Technology: Issue on Information and Communications Technology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Turbo Q-Learning (TQL) for Energy Efficiency Optimization in Heterogeneous Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy