Deep Multi-Task Conditional and Sequential Learning for Anti-Jamming

Robert Basomingera,Young-June Choi

doi:10.1109/access.2021.3109856

Abstract

Multi-task learning provides plenty of room for performance improvement to single-task learning, when learned tasks are related and learned with mutual information. In this work, we analyze the efficiency of using a single-task reinforcement learning algorithm to mitigate jamming attacks with frequency hopping strategy. Our findings show that single-task learning implementations do not always guarantee optimal cumulative reward when some jammer’s parameters are unknown, notably the jamming time-slot length in this case. Therefore, to maximize packet transmission in the presence of a jammer whose parameters are unknown, we propose deep multi-task conditional and sequential learning (DMCSL), a multi-task learning algorithm that builds a transition policy to optimize conditional and sequential tasks. For the anti-jamming system, the proposed model learns two tasks: sensing time and transmission channel selection. DMCSL is a composite of the state-of-the-art reinforcement learning algorithms, multi-armed bandit and an extended deep-Q-network. To improve the chance of convergence and optimal cumulative reward of the algorithm, we also propose a continuous action-space update algorithm for sensing time action-space. The simulation results show that DMCSL guarantees better performance than single-task learning by relying on a logarithmically increased action-space sample. Against a random dynamic jamming time-slot, DMCSL achieves about three times better cumulative reward, and against a periodic dynamic jamming time-slot, it improves by 10% the cumulative reward.

Highlights

I N wireless communication, network interference happens when nearby communicating nodes transmit at the same time with closer frequencies, resulting in a jamming attack if done intentionally
By following a decaying1, for each iteration, the agent explores the transmission in the environment to learn the jammer activities or exploits the transmission in the environment with the higher expected reward based on previously computed statistics, solved by calculating the optimal state-action value function using deep Q-network (DQN)
The analysis showed that designing anti-jamming learning systems as single-task learning for the transmission channel selection does not always guarantee optimal performance in the long run if the sensing time used is not optimal

Summary

INTRODUCTION

I N wireless communication, network interference happens when nearby communicating nodes transmit at the same time with closer frequencies, resulting in a jamming attack if done intentionally. We formulate anti-jamming as single-task learning in which the transmitter agent interacts with the environment (made of receiver, jammer, and other transmitter nodes) in a sequence of state S, action A, and reward r. By following a decaying , for each iteration, the agent explores the transmission in the environment to learn the jammer activities or exploits the transmission in the environment with the higher expected reward based on previously computed statistics, solved by calculating the optimal state-action value function using deep Q-network (DQN) Both the jammer and transmitter abide by the Assumption II.. When a transmitter agent does not know the jammer’s internal working details, solving jamming attacks as a single-task RL problem, which only learns about channel hopping policy, without optimizing sensing time, does not always guarantee the optimal accumulated reward, especially against a dynamic jamming time-slot.

33: End the episodes

ACTION-SPACE UPDATE ALGORITHM

ACTION-SPACE INCREASE ANALYSIS

COMPLEXITY ANALYSIS

SIMULATION RESULTS

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep Multi-Task Conditional and Sequential Learning for Anti-Jamming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Knowledge triple mining via multi-task learning
Zhao Zhang ... Hui Xiong
Information Systems | VOL. 80
Zhao Zhang, et. al.Zhao Zhang ... Hui Xiong
19 Sep 2018
Information Systems | VOL. 80

Evaluation of multi-task learning in deep learning-based positioning classification of mandibular third molars
Shintaro Sukegawa ... Takeshi Hara
Scientific Reports | VOL. 12
Shintaro Sukegawa, et. al.Shintaro Sukegawa ... Takeshi Hara
13 Jan 2022
Scientific Reports | VOL. 12

Comparative Study of Single-task and Multi-task Learning on Research Protocol Document Classification
Abid Famasya Abdillah ... Riyanarto Sarno
-
Abid Famasya Abdillah, et. al.Abid Famasya Abdillah ... Riyanarto Sarno
20 Oct 2021
20 Oct 2021

Method for multi-task learning fusion network traffic classification to address small sample labels
Lan Liu ... Junhan Hu
Scientific Reports | VOL. 14
Lan Liu, et. al.Lan Liu ... Junhan Hu
30 Jan 2024
Scientific Reports | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Multi-Task Conditional and Sequential Learning for Anti-Jamming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access