Malware Detection Inside App Stores Based on Lifespan Measurements

Carlos Cilleruelo,Jose-Javier Martinez-Herraiz,Enrique-Larriba Enrique-Larriba,Luis De-Marcos

doi:10.1109/access.2021.3107903

Carlos Cilleruelo, Jose-Javier Martinez-Herraiz + Show 2 more

Open Access

https://doi.org/10.1109/access.2021.3107903

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 5	License type: CC BY 4.0

Affiliation: University of Alcalá

Abstract

Potentially Harmful Apps (PHAs), like any other type of malware, are a problem in the modern Android ecosystem. Even though Google tries to maintain a clean app ecosystem, Google Play Store is still one of the main vectors for spreading PHAs. In this paper, we propose a solution based on machine learning algorithms to detect PHAs inside application markets. Being the application markets one of the main entry vectors, a solution capable of detecting PHAs submitted or in submission to those markets is needed. This solution is capable of detecting PHAs inside an application market and can be used as a filtering method, to automatically block the publishing of novel PHAs. The proposed solution is based on application static analysis, and even though several static analysis solutions have been developed, the innovation of this system is based on its training and the creation of its dataset. We have created a new dataset that uses as criteria the lifespan of an application inside Google Play, the shorter time an application is active inside an application market the higher the probability that this is a PHA. This criterion was added in order to avoid the usage and bias of antivirus engines for detecting malware. Involving the lifespan as criteria we created a new method of detection that does not replicate any existing antivirus engines. Experimental results have proved that this solution obtains a 90% accuracy score, using a dataset of 91,203 applications published on the Google Play Store. Despite showing a decrease in accuracy, compared with other machine learning models focused on detecting PHAs; it is necessary to take into account that this is a complementary and different method. The presented work can be combined with other static and dynamic machine learning models, since its training is drastically different, as it was based on lifespan measurements.

Highlights

M ALWARE detection techniques are constantly evolving due to the necessity of detecting the presence of malware
We present a novel method of detection based on lifespan measurements that can be used for detecting malware in application markets
Even though the model trained with the XGB algorithm reaches 89% accuracy, the Random Forest Classification (RFC) model achieves 90% accuracy with a false positive rate of 5.43%

Summary

Introduction

M ALWARE detection techniques are constantly evolving due to the necessity of detecting the presence of malware. Cybercriminals are constantly changing their techniques and novel methods of detection are needed to be developed. According to Statcounter, Android has a market share greater than 72% [1] This situation has caused an increase in the malware ecosystem because of its popularity [2] [3]. All of this is related to the rise of smartphone users worldwide, more than 6 billion in 2021 [4]. Being Google Play Store the main distribution vector, novel techniques that control who published and which applications are published need to be developed. Some of them could have heavy policies against adware, and others tolerate this type of PHAs

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Malware Detection Inside App Stores Based on Lifespan Measurements

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A measurement study of google play
Nicolas Viennot ... Edward Garcia
ACM SIGMETRICS Performance Evaluation Review | VOL. 42
Nicolas Viennot, et. al.Nicolas Viennot ... Edward Garcia
16 Jun 2014
ACM SIGMETRICS Performance Evaluation Review | VOL. 42

A measurement study of google play
Nicolas Viennot ... Edward Garcia
-
Nicolas Viennot, et. al.Nicolas Viennot ... Edward Garcia
16 Jun 2014
16 Jun 2014

Gamification for Diabetes Type 1 Management: A Review of the Features of Free Apps in Google Play and App Stores.
Demah Alsalman ... Zainab F Alnosaier
Journal of Multidisciplinary Healthcare | VOL. 13
Demah Alsalman, et. al.Demah Alsalman ... Zainab F Alnosaier
01 May 2020
Journal of Multidisciplinary Healthcare | VOL. 13

Using Aspect-Level Sentiments for Calling App Recommendation with Hybrid Deep-Learning Models
Naila Aslam ... Afifa Hameed
Applied Sciences | VOL. 12
Naila Aslam, et. al.Naila Aslam ... Afifa Hameed
26 Aug 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Malware Detection Inside App Stores Based on Lifespan Measurements

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access