Con2Mix: A semi-supervised method for imbalanced tabular security data1

Xiaodi Li,Kevin Hamlen,Shamila Wickramasuriya,Mahmoud Zamani,Bhavani Thuraisingham,Latifur Khan

doi:10.3233/jcs-220130

Abstract

Con2Mix (Contrastive Double Mixup) is a new semi-supervised learning methodology that innovates a triplet mixup data augmentation approach for finding code vulnerabilities in imbalanced, tabular security data sets. Tabular data sets in cybersecurity domains are widely known to pose challenges for machine learning because of their heavily imbalanced data (e.g., a small number of labeled attack samples buried in a sea of mostly benign, unlabeled data). Semi-supervised learning leverages a small subset of labeled data and a large subset of unlabeled data to train a learning model. While semi-supervised methods have been well studied in image and language domains, in security domains they remain underutilized, especially on tabular security data sets which pose especially difficult contextual information loss and balance challenges for machine learning. Experiments applying Con2Mix to collected security data sets show promise for addressing these challenges, achieving state-of-the-art performance on two evaluated data sets compared with other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Con2Mix: A semi-supervised method for imbalanced tabular security data1

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Security

Lead the way for us

Journal: Journal of Computer Security	Publication Date: Nov 10, 2023
Citations: 2

Similar Papers

MCoM: A Semi-Supervised Method for Imbalanced Tabular Security Data
Xiaodi Li ... Bhavani Thuraisingham
-
Xiaodi Li, et. al.Xiaodi Li ... Bhavani Thuraisingham
01 Jan 2021
01 Jan 2021

2MiCo: A Contrastive Semi-Supervised Method with Double Mixup for Smart Meter Modbus RS-485 Communication Security
Xiaodi Li ... Md Delwar Hossain
-
Xiaodi Li, et. al.Xiaodi Li ... Md Delwar Hossain
01 May 2023
2MiCo: A Contrastive Semi-Supervised Method with Double Mixup for Smart Meter Modbus RS-485 Communication Security
Xiaodi Li ... Md Delwar Hossain

Perturbation of deep autoencoder weights for model compression and classification of tabular data
Sakib Abrar ... Manar D Samad
Neural networks : the official journal of the International Neural Network Society | VOL. 156
Sakib Abrar, et. al.Sakib Abrar ... Manar D Samad
27 Sep 2022
Neural networks : the official journal of the International Neural Network Society | VOL. 156

A topological approach for semi-supervised learning
A Inés ... J Rubio
Journal of Computational Science | VOL. 82
A Inés, et. al.A Inés ... J Rubio
03 Aug 2024
Journal of Computational Science | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Con2Mix: A semi-supervised method for imbalanced tabular security data1

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Security