Dynamic spectrum access under partial observations: A restless bandit approach

Nima Akbarzadeh,Aditya Mahajan

doi:10.1109/cwit.2019.8929931

Abstract

We consider a communication system where multiple unknown channels are available for transmission. Each channel is a channel with state which evolves in a Markov manner. The transmitter has to select L channels to use and also decide the resources (e.g., power, rate, etc.) to use for each of the selected channels. It observes the state of the channels it uses and receives no feedback on the state of the other channels. We model this problem as a partially observable Markov decision process and obtain a simplified belief state. We show that the optimal resource allocation policy can be identified in closed form. Once the optimal resource allocation policy is fixed, choosing the channel scheduling policy may be viewed as a restless bandit. We present an efficient algorithm to check indexability and compute the Whittle index for each channel. When the model is indexable, the Whittle index policy, which transmits over the L channels with the smallest Whittle indices, is an attractive heuristic policy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic spectrum access under partial observations: A restless bandit approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Towards Q-learning the Whittle Index for Restless Bandits
Jing Fu ... Sarat Moka
-
Jing Fu, et. al.Jing Fu ... Sarat Moka
01 Nov 2019
01 Nov 2019

Reinforcement Learning Based QoS-Provisioning over Energy-Harvesting 5G Wireless Ad-Hoc Networks
Xi Zhang ... H Vincent Poor
-
Xi Zhang, et. al.Xi Zhang ... H Vincent Poor
01 Dec 2019
01 Dec 2019

Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño ... Konstantin Avrachenkov
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 9
Francisco Robledo Relaño, et. al.Francisco Robledo Relaño ... Konstantin Avrachenkov
13 Aug 2024
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 9

Channel probing for opportunistic access with multi-channel sensing
Keqin Liu ... Qing Zhao
-
Keqin Liu, et. al.Keqin Liu ... Qing Zhao
01 Oct 2008
01 Oct 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic spectrum access under partial observations: A restless bandit approach

Abstract

Talk to us

Similar Papers