Opcode and API Based Machine Learning Framework For Malware Classification

Hrishabh Soni,Durga Prasad Mohapatra,Pushkar Kishore

doi:10.1109/conit55038.2022.9848152

Abstract

Traditional machine learning (ML) based malware detectors depend on crafted human features that fail for recent malware. Deep learning (DL) based solutions solve the above issue but require a lot of training time. The real challenge is designing a malware detector with a higher F1-score for ML techniques. In this paper, we present a novel framework that classifies malware using the features named opcode and application programming interface (API) calls. First, API calls and opcodes are extracted using interactive disassembler pro (IDA pro) from the malicious samples' assembly language source code (ALSC) file. Then, the continuous n-gram technique is applied to the extracted API and opcode to create the dataset's features. The value of features in each row is based on its frequency in the concerned extracted behaviors. We scale the values in the dataset using the term frequency-inverse document frequency (TF-IDF) methodology. The best combination of n-gram and feature selection techniques is identified for API and opcode based datasets. The final label of the malicious samples is decided by the highest probability of the detection made by API and opcode based detectors. For analysis, an off-the-shelf dataset named Microsoft Malware is used. We achieve an F1-score of 96% for API-based detector and F1-score of 98% for opcode based detector. Our framework achieves an overall F1-score of 99.3%, better than the recent state-of-the-art techniques. Apart from attaining a higher Fl-score, there is a reduction in training time due to using ML techniques instead of DL techniques,

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Opcode and API Based Machine Learning Framework For Malware Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Multi-View attention-based deep learning framework for malware detection in smart healthcare systems
Vinayakumar Ravi ... Rajasekhar Chaganti
Computer Communications | VOL. 195
Vinayakumar Ravi, et. al.Vinayakumar Ravi ... Rajasekhar Chaganti
19 Aug 2022
Computer Communications | VOL. 195

Ransomware Behavior on Windows Endpoint: An Analysis
Wira Z A Zakaria ... Mohd Faizal Abdollah
Journal of Social Science and Humanities | VOL. 6
Wira Z A Zakaria, et. al.Wira Z A Zakaria ... Mohd Faizal Abdollah
30 Oct 2023
Journal of Social Science and Humanities | VOL. 6

Advanced Windows Methods on Malware Detection and Classification
Dima Rabadi ... Sin G Teo
-
Dima Rabadi, et. al.Dima Rabadi ... Sin G Teo
07 Dec 2020
07 Dec 2020

Comparison of Malware Classification Methods using Convolutional Neural Network based on API Call Stream
Matthew Schofield ... Alex Lam
International Journal of Network Security & Its Applications | VOL. 13
Matthew Schofield, et. al.Matthew Schofield ... Alex Lam
31 Mar 2021
International Journal of Network Security & Its Applications | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Opcode and API Based Machine Learning Framework For Malware Classification

Abstract

Talk to us

Similar Papers