Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling

Charles E Thornton,Anthony F Martone,R Michael Buehrer

doi:10.1109/globecom42002.2020.9322256

Abstract

This paper describes a sequential, or online, learning scheme for adaptive radar transmissions that facilitate spectrum sharing with a non-cooperative cellular network. First, the interference channel between the radar and a spatially distant cellular network is modeled. Then, a linear Contextual Bandit (CB) learning framework is applied to drive the radar’s behavior. The fundamental trade-off between exploration and exploitation is balanced by a proposed Thompson Sampling (TS) algorithm, a pseudo-Bayesian approach which selects waveform parameters based on the posterior probability that a specific waveform is optimal, given discounted channel information as context. It is shown that the contextual TS approach converges more rapidly to behavior that minimizes mutual interference and maximizes spectrum utilization than comparable online learning algorithms. Additionally, it is shown that the TS learning scheme results in a favorable SINR distribution compared to other online learning algorithms. Finally, the proposed TS algorithm is compared to a deep reinforcement learning model. Simulation results show that the TS algorithm maintains competitive performance with a more complex Deep Q-Network (DQN).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance Comparison of UCB, TS, and -Greedy TS Algorithms through Simulation of Multi-Armed Bandit Machine
Zhuoran Liu
Applied and Computational Engineering | VOL. 83
Zhuoran LiuZhuoran Liu
31 Oct 2024
Applied and Computational Engineering | VOL. 83

Adaptive OFDM Based on Thompson Sampling Algorithm without Channel Knowledge
Haipeng Luo ... Ruixuan He
-
Haipeng Luo, et. al.Haipeng Luo ... Ruixuan He
23 Sep 2022
23 Sep 2022

Program Placement Optimization for Storage-constrained Mobile Edge Computing Systems: A Multi-armed Bandit Approach
Mingjie Feng ... Marwan Krunz
-
Mingjie Feng, et. al.Mingjie Feng ... Marwan Krunz
01 Jun 2021
01 Jun 2021

Collaborative Thompson Sampling
Zhenyu Zhu ... Hongli Xu
-
Zhenyu Zhu, et. al.Zhenyu Zhu ... Hongli Xu
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling

Abstract

Talk to us

Similar Papers