Malware Classification Based on Semi-Supervised Learning

Yu Ding,Zisen Qi,Haiping Wang,Jian Xing,Xiaoyu Zhang,Siyu Jia,Menghan Guo,Binbin Li,Qian Qiang

doi:10.1007/978-3-031-17551-0_19

Abstract

AbstractWith the rapid evolution of malware in the past few years, it caused serious threats and damage to network security. To handle this, researchers began to propose effective classification approaches for various malware variants. However, these widely-used methods based on deep learning are in fully supervised manner, which suffers from two inevitable problems: 1) time-consuming: manually labeling data before training fully-supervised models require huge manual efforts. 2) resource-redundancy: a large amount of unlabeled data is not fully used, resulting in a resource waste. To solve the above problems, in this paper we propose a Malware Classification Method based on Semi-Supervised Learning namely MCM-SSL, which divides the model training into a pre-train stage using unlabeled data and a finetune stage using labeled data. The method proposed in this paper effectively uses a large amount of unlabeled data, and only needs a small amount of labeled data to achieve excellent performance. As a result, our method achieves an accuracy of 90.51% on the open-source Virus-MNIST dataset, which is superior to recent state-of-the-art methods. We also verify the generality and robustness of our method using a variety of common neural network algorithms. For the same algorithm, the accuracy of the pre-trained model is on average 2.4% higher than the model without pre-training.KeywordsMalware classificationSemi-supervised learningContrastive learning

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Malware Classification Based on Semi-Supervised Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Novel Two-Stage Unsupervised Fault Recognition Framework Combining Feature Extraction and Fuzzy Clustering for Collaborative AIoT
Xufeng Hu ... Lei Jia
IEEE Transactions on Industrial Informatics | VOL. 18
Xufeng Hu, et. al.Xufeng Hu ... Lei Jia
29 Apr 2021
IEEE Transactions on Industrial Informatics | VOL. 18

Threshold Filtering Semi-Supervised Learning Method for SAR Target Recognition
Linshan Shen ... Ye Tian
Computers, Materials & Continua | VOL. 73
Linshan Shen, et. al.Linshan Shen ... Ye Tian
01 Jan 2021
Computers, Materials & Continua | VOL. 73

Semi-MCNN: A Semisupervised Multi-CNN Ensemble Learning Method for Urban Land Cover Classification Using Submeter HRRS Images
Runyu Fan ... Lizhe Wang
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 13
Runyu Fan, et. al.Runyu Fan ... Lizhe Wang
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 13

A review of research and development of semi-supervised learning strategies for medical image processing
Shengke Yang
EAI Endorsed Transactions on e-Learning | VOL. 9
Shengke YangShengke Yang
16 Jan 2024
EAI Endorsed Transactions on e-Learning | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Malware Classification Based on Semi-Supervised Learning

Abstract

Talk to us

Similar Papers