Decompiled APK based malicious code classification

Roni Mateless,Daniel Rejabek,Oded Margalit,Robert Moskovitch

doi:10.1016/j.future.2020.03.052

Roni Mateless, Daniel Rejabek + Show 2 more

https://doi.org/10.1016/j.future.2020.03.052

Copy DOI

Abstract

Due to the increasing growth in the variety of Android malware, it is important to distinguish between the unique types of each. In this paper, we introduce the use of a decompiled source code for malicious code classification. This decompiled source code provides deeper analysis opportunities and understanding of the nature of malware. Malicious code differs from text due to syntax rules of compilers and the effort of attackers to evade potential detection. Hence, we adapt Natural Language Processing-based techniques under some constraints for malicious code classification. First, the proposed methodology decompiles the Android Package Kit files, then API calls, keywords, and non-obfuscated tokens are extracted from the source code and categorized to stop-tokens, feature-tokens, and long-tail-tokens. We also introduce the use of generalized N-tokens to represent tokens that are typically less frequent. Our approach was evaluated, in comparison to the use of API calls and permissions for features, as a baseline, and their combination, as well as in comparison to the use of neural network architectures based on decompiled Android Package Kits. A rigorous evaluation of comprehensive public real-world Android malware datasets, including 24,553 apps that were categorized to 71 families for the malicious families classification, and 60,000 apps for malicious code detection was performed. Our approach outperformed the baselines in both tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decompiled APK based malicious code classification

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Journal: Future Generation Computer Systems	Publication Date: Apr 11, 2020
Citations: 15

Similar Papers

Extensible Android Malware Detection and Family Classification Using Network-Flows and API-Calls
Laya Taheri ... Arash Habibi Lashkari
-
Laya Taheri, et. al.Laya Taheri ... Arash Habibi Lashkari
01 Oct 2019
01 Oct 2019

Malware detection on android smartphones using keywords vector and SVM
Junmei Sun ... Kai Yan
-
Junmei Sun, et. al.Junmei Sun ... Kai Yan
01 May 2017
01 May 2017

The rise of obfuscated Android malware and impacts on detection methods.
Wael F Elsersy ... Ali Feizollah
PeerJ Computer Science | VOL. 8
Wael F Elsersy, et. al.Wael F Elsersy ... Ali Feizollah
09 Mar 2022
PeerJ Computer Science | VOL. 8

DroidMat: Android Malware Detection through Manifest and API Calls Tracing
Dong-Jie Wu ... Ching-Hao Mao
-
Dong-Jie Wu, et. al.Dong-Jie Wu ... Ching-Hao Mao
01 Aug 2012
01 Aug 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decompiled APK based malicious code classification

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems