Extracting the Representative API Call Patterns of Malware Families Using Recurrent Neural Network

Iltaek Kwon,Eul Gyu Im

doi:10.1145/3129676.3129712

Abstract

With thousands of malware samples pouring out every day, how can we reduce malware analysis time and detect them effectively? Malware family classification provides one of good measures to predict characteristics of unknown malware since malware belonging to the same family can have similar features. Static analysis and dynamic analysis are techniques to obtain features to be used for classifying malware samples to their families. Static analysis performs analysis based on specific signatures included in the malware. Static analysis has the advantages that the scope of the analysis covers the entire code, and the analysis can be performed without executing the malware. However, it is very difficult to detect or classify malware variants with only the results of the static analysis, because malware developers use polymorphic or encryption techniques to avoid static analysis-based detection of anti-virus software. Dynamic analysis analyzes malware behaviors, so the results of dynamic analysis can be used to detect or classify malware variants. One of dynamic features that can be used to detect or classify malware variants is API call sequences. In this paper, we propose a novel method to extract representative API call patterns of malware families using Recurrent Neural Network (RNN). We conducted experiments with 787 malware samples belonging to 9 families. In our experiments, we extracted representative API call patterns of 9 malware families on 551 samples as a training set and performed classification on the 236 samples as a test set. Classification accuracy results using API call patterns extracted from RNN were measured as 71% on average. The results show the feasibility of our approach using RNN to extract representative API call pattern of malware families for malware family classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extracting the Representative API Call Patterns of Malware Families Using Recurrent Neural Network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

MAlign: Explainable static raw-byte based malware family classification using sequence alignment
Shoumik Saha ... Atif Hasan Rahman
Computers & Security | VOL. 139
Shoumik Saha, et. al.Shoumik Saha ... Atif Hasan Rahman
17 Jan 2024
Computers & Security | VOL. 139

A Hybrid Approach for Malware Family Classification
Naqqash Aman ... Farrukh Shahzad
-
Naqqash Aman, et. al.Naqqash Aman ... Farrukh Shahzad
01 Jan 2017
01 Jan 2017

Unsuccessful story about few shot malware family classification and siamese network to the rescue
Yude Bai ... Xiaohong Li
-
Yude Bai, et. al.Yude Bai ... Xiaohong Li
27 Jun 2020
27 Jun 2020

HDM-Analyser: a hybrid analysis approach based on data mining techniques for malware detection
Mojtaba Eskandari ... Sattar Hashemi
Journal of Computer Virology and Hacking Techniques | VOL. 9
Mojtaba Eskandari, et. al.Mojtaba Eskandari ... Sattar Hashemi
17 Feb 2013
Journal of Computer Virology and Hacking Techniques | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extracting the Representative API Call Patterns of Malware Families Using Recurrent Neural Network

Abstract

Talk to us

Similar Papers