Malware detection based on TF-(IDF&ICF) method

Bin Qin,Junpeng Zhang,Hongyu Chen

doi:10.1088/1742-6596/2024/1/012030

Bin Qin, Junpeng Zhang + Show 1 more

Open Access

https://doi.org/10.1088/1742-6596/2024/1/012030

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Sep 1, 2021
Citations: 4	License type: cc-by

Affiliation: Shenzhen University

Abstract

As the level of information technology continues to increase, information security problems caused by malware are becoming more and more serious. An important method to detect malware is to analyze software behavior information, such as permissions, API call sequences and system calls. In this paper, API call sequences are used as the research object for malware detection, and the traditional feature extraction methods are not ideal for API call sequences. In this paper, a TF-(IDF&ICF) feature extraction method is proposed by mathematical analysis, which combines document and category level features. Experiments show that using the feature extractor proposed in this paper, followed by training, the performance is improved in four different machine learning models, and the F1 can reach 0.979, while the system response time is significantly reduced, which has good practical value.

Full Text