Attention-Based Cross-Modal CNN Using Non-Disassembled Files for Malware Classification

Jeongwoo Kim,Eun-Sun Cho,Joon-Young Paik

doi:10.1109/access.2023.3253770

Jeongwoo Kim, Eun-Sun Cho + Show 1 more

Open Access

https://doi.org/10.1109/access.2023.3253770

Copy DOI

Abstract

The role of malware classification is crucial in addressing the explosive increase in malware variants. By classifying malware instances into malware families, malware analysts can apply appropriate techniques and tools to handle malware variants in each family. Using high-level representations of malware, such as disassembled codes, yields meaningful classification performance. However, malware classification based on disassembled codes depends on the practically implausible assumption that every malware is correctly reversed by disassemblers. Unfortunately, sophisticated malware, which has anti-disassembly capabilities, seeks to confuse disassemblers, yielding incorrectly disassembled codes. In this study, we focus on malware family classification, which requires no disassembly, and propose a new CNN-based malware classification model using non-disassembled malware files (i.e., binary files). Our model associates two modalities: “malware images” and “structural entropies,” which are converted and extracted from binary files. Both modalities have different granularities of bytes and chunks that complement each other. The model adopts a cross-modal attention mechanism to combine the features of the two modalities by moderating their expressive limitations. We validate our model using three popular datasets from the Kaggle Microsoft Malware Classification, Malimg, and BODMAS datasets. The experimental results show that our model identifies malware families with a higher degree of accuracy than previous methods and does not require the burden of disassembling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 3	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Attention-Based Cross-Modal CNN Using Non-Disassembled Files for Malware Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Malware Family Classification using Active Learning by Learning
Chin-Wei Chen ... Ping-Hao Bair
-
Chin-Wei Chen, et. al.Chin-Wei Chen ... Ping-Hao Bair
01 Feb 2020
01 Feb 2020

MalClassifier: Malware family classification using network flow sequence behaviour
Bushra A Alahmadi ... Ivan Martinovic
-
Bushra A Alahmadi, et. al.Bushra A Alahmadi ... Ivan Martinovic
01 May 2018
01 May 2018

Efficient Windows malware identification and classification scheme for plant protection information systems.
Zhiguo Chen ... Shuangshuang Xing
Frontiers in plant science | VOL. 14
Zhiguo Chen, et. al.Zhiguo Chen ... Shuangshuang Xing
15 Feb 2023
Frontiers in plant science | VOL. 14

Semantic analysis and classification of malware for UNIX-likeoperating systems with the use of machine learning methods
Maksym V Mishchenko ... Mariia S Dorosh
Applied Aspects of Information Technology | VOL. 5
Maksym V Mishchenko, et. al.Maksym V Mishchenko ... Mariia S Dorosh
28 Dec 2022
Applied Aspects of Information Technology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention-Based Cross-Modal CNN Using Non-Disassembled Files for Malware Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Access