Handling Labeled Data Insufficiency: Semi-supervised Learning with Self-Training Mixup Decision Tree for Classification of Network Attacking Traffic

Yubo Hou,Chee-Keong Kwoh,Min Wu,Zhenghua Chen,Tram Truong-Huu,Sin G Teo

doi:10.1109/tdsc.2022.3195534

Abstract

Motivated by the fast advancements in artificial intelligence (AI) technologies, recent research has moved towards using machine learning and deep learning to detect and classify security attacks in computer networks. However, most prior works adopt supervised learning methods, and the performance heavily depends on the amount of labeled data used to train the detection models. Network attack detection and classification is not an exception due to the lack of labeled data, especially the attacking traffic, which is much less than the regular (legitimate) traffic. Yet, labeling network traffic is also challenging and requires specific domain expertise. This paper proposes an efficient semi-supervised learning method for the classification of network attacking traffic, known as Self-Training Mixup Decision Tree (STM-DT). STM-DT first trains a decision tree on a small amount of labeled data and then uses the obtained model to predict labels of unlabeled samples. Some noisy labels will be removed by consistency. The predicted samples will then be mixed with labeled samples using <monospace>mixup</monospace> to train a new decision tree, which is the final desired classifier. We evaluate STM-DT using four network traffic datasets. Experimental results demonstrate that the proposed STM-DT method achieves higher macro F1 scores over different minority labeled data percentages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Handling Labeled Data Insufficiency: Semi-supervised Learning with Self-Training Mixup Decision Tree for Classification of Network Attacking Traffic

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Journal: IEEE Transactions on Dependable and Secure Computing	Publication Date: Jan 1, 2024
Citations: 5

Similar Papers

Response to M. Trengove & coll regarding "Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine".
Stefan Harrer
eBioMedicine | VOL. 93
Stefan HarrerStefan Harrer
01 Jul 2023
eBioMedicine | VOL. 93

A Novel Approach for Data Collection and Network Attack Warning
Van Kha Nguyen ... Thanh Hai Nguyen
-
Van Kha Nguyen, et. al.Van Kha Nguyen ... Thanh Hai Nguyen
01 Oct 2019
01 Oct 2019

ChatGPT Isn't Magic
Tama Leaver ... Suzanne Srdarov
M/C Journal | VOL. 26
Tama Leaver, et. al.Tama Leaver ... Suzanne Srdarov
02 Oct 2023
M/C Journal | VOL. 26

Detection and classification of network attacks using the deepneural network cascade
Irina M Shpinareva ... Lyudmila A Voloshchuk
Herald of Advanced Information Technology | VOL. 4
Irina M Shpinareva, et. al.Irina M Shpinareva ... Lyudmila A Voloshchuk
15 Oct 2021
Herald of Advanced Information Technology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Handling Labeled Data Insufficiency: Semi-supervised Learning with Self-Training Mixup Decision Tree for Classification of Network Attacking Traffic

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing