Empirical Study on Intelligent Android Malware Detection based on Supervised Machine Learning

Talal A.A Abdullah,Waleed Ali,Rawad Abdulghafor

doi:10.14569/ijacsa.2020.0110429

Talal A.A Abdullah, Waleed Ali + Show 1 more

Open Access

https://doi.org/10.14569/ijacsa.2020.0110429

Copy DOI

Abstract

The increasing number of mobile devices using the Android operating system in the market makes these devices the first target for malicious applications. In recent years, several Android malware applications were developed to perform certain illegitimate activities and harmful actions on mobile devices. In response, specific tools and anti-virus programs used conventional signature-based methods in order to detect such Android malware applications. However, the most recent Android malware apps, such as zero-day, cannot be detected through conventional methods that are still based on fixed signatures or identifiers. Therefore, the most recently published research studies have suggested machine learning techniques as an alternative method to detect Android malware due to their ability to learn and use the existing information to detect the new Android malware apps. This paper presents the basic concepts of Android architecture, Android malware, and permission features utilized as effective malware predictors. Furthermore, a comprehensive review of the existing static, dynamic, and hybrid Android malware detection approaches is presented in this study. More significantly, this paper empirically discusses and compares the performances of six supervised machine learning algorithms, known as K-Nearest Neighbors (K-NN), Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), Naïve Bayes (NB), and Logistic Regression (LR), which are commonly used in the literature for detecting malware apps.

Highlights

Android constitutes the most common mobile operating system [1] that presently dominates the smartphone market
It compares and discusses the performances of six supervised machine learning algorithms, which are commonly used in the literature for detecting malware apps, known as K-Nearest Neighbors (K-NN), Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), Naïve Bayes (NB), and Logistic Regression (LR)
The Android operating system is a stack of components that can be defined as consisting of five layers that organize the functions of the system in the form of the Linux kernel layer, hardware abstractor layer, Android libraries layer, Java application program interfaces (API) framework layer, and system application layer

Summary

INTRODUCTION

Android constitutes the most common mobile operating system [1] that presently dominates the smartphone market. Many Android commercial tools and antivirus programs have been developed to detect android malware applications Most of these commercial Android malware detection tools are based on using fixed signatures or identifiers. These commercial tools, only perform well in detecting the Android malware applications with known signatures or identifiers and may fail to detect the unknown Android malware apps [5] that have been developed more recently, especially zero-day malware apps In other words, these commercial tools are unable to make accurate decisions when determining whether the new Android app is a malware or not [6][7]. Numerous research works [8][9][4][10] focused on training machine learning classification algorithms based on known Android malware apps in order to detect unknown Android malware applications.

RELATED WORK

Intelligent Android Malware Detection Approach based on Static Analysis

Intelligent Android Malware Detection Approach based on Dynamic Analysis

Intelligent Android Malware Detection Approach based on Hybrid Analysis

Other Advanced Intelligent Techniques

SUMMARY OF CONTRIBUTIONS

ANDROID ARCHITECTURE

The Linux Kernel

Hardware Abstractor Layer

Android Libraries

Java API Framework All Android OS features that are available for use through

ANDROID MALWARE

SUPERVISED MACHINE LEARNING

K- Nearest Neighbours

Decision Trees

Support Vector Machine

Random Forest

Naïve Bayes

METHODOLOGY

Data Collection

Feature Extraction

Training of Classification Models

Performance Evaluation

Experiments Environment

Evaluation Methods and Measures

Discussion

CONCLUSION AND FUTURE WORK

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2020
Citations: 9	License type: cc-by

R Discovery Prime

R Discovery Prime

Empirical Study on Intelligent Android Malware Detection based on Supervised Machine Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Classification and Analysis of Android Malware Images Using Feature Fusion Technique
Tanya Gera ... Deepak Thakur
IEEE Access | VOL. 9
Tanya Gera, et. al.Tanya Gera ... Deepak Thakur
01 Jan 2020
IEEE Access | VOL. 9

MUDROID: Android malware detection and classification based on permission and behavior for autonomous vehicles
Bochang Wang ... Hai Da
Transactions on Emerging Telecommunications Technologies | VOL. 34
Bochang Wang, et. al.Bochang Wang ... Hai Da
07 Aug 2023
Transactions on Emerging Telecommunications Technologies | VOL. 34

Android Malware Classification Using Optimized Ensemble Learning Based on Genetic Algorithms
Omar Barukab ... Altyeb Taha
Sustainability | VOL. 14
Omar Barukab, et. al.Omar Barukab ... Altyeb Taha
03 Nov 2022
Sustainability | VOL. 14

An Automated Vision-Based Deep Learning Model for Efficient Detection of Android Malware Attacks
Walid El-Shafai ... Iman Almomani
IEEE Access | VOL. 10
Walid El-Shafai, et. al.Walid El-Shafai ... Iman Almomani
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Empirical Study on Intelligent Android Malware Detection based on Supervised Machine Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications