An Analysis of Android Malware Classification Services.

Mohammed Rashed,Guillermo Suarez-Tangil

doi:10.3390/s21165671

Abstract

The increasing number of Android malware forced antivirus (AV) companies to rely on automated classification techniques to determine the family and class of suspicious samples. The research community relies heavily on such labels to carry out prevalence studies of the threat ecosystem and to build datasets that are used to validate and benchmark novel detection and classification methods. In this work, we carry out an extensive study of the Android malware ecosystem by surveying white papers and reports from 6 key players in the industry, as well as 81 papers from 8 top security conferences, to understand how malware datasets are used by both. We, then, explore the limitations associated with the use of available malware classification services, namely VirusTotal (VT) engines, for determining the family of an Android sample. Using a dataset of 2.47 M Android malware samples, we find that the detection coverage of VT’s AVs is generally very low, that the percentage of samples flagged by any 2 AV engines does not go beyond 52%, and that common families between any pair of AV engines is at best 29%. We rely on clustering to determine the extent to which different AV engine pairs agree upon which samples belong to the same family (regardless of the actual family name) and find that there are discrepancies that can introduce noise in automatic label unification schemes. We also observe the usage of generic labels and inconsistencies within the labels of top AV engines, suggesting that their efforts are directed towards accurate detection rather than classification. Our results contribute to a better understanding of the limitations of using Android malware family labels as supplied by common AV engines.

Highlights

With more than 2.8B active users worldwide, Android is the most used OS on mobile devices [1]
We first analyze the fraction of samples that have been detected as malware by at least one engine and are given a non-empty label
We explored the usage of malware classification in both communities

Summary

Introduction

With more than 2.8B active users worldwide, Android is the most used OS on mobile devices [1]. Because of the limited number of detected malware samples early on, human analysts were able to study samples, identify their behavior, and label them following an internal scheme of the AV company, most likely including the platform, type, and family of the sample (see Section 5.2). Such a surge made it inevitable for AVs to use automation techniques in both detection and family classification because of the impossibility of manually handling the influx of samples arriving to AVs [7].

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Aug 23, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Analysis of Android Malware Classification Services.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Machine-Learning based analysis and classification of Android malware signatures
Ignacio Martín ... Sergio De Los Santos
Future Generation Computer Systems | VOL. 97
Ignacio Martín, et. al.Ignacio Martín ... Sergio De Los Santos
07 Mar 2019
Future Generation Computer Systems | VOL. 97

RmvDroid: Towards A Reliable Android Malware Dataset with App Metadata
Haoyu Wang ... Junjun Si
-
Haoyu Wang, et. al.Haoyu Wang ... Junjun Si
01 May 2019
01 May 2019

An empirical study of problems and evaluation of IoT malware classification label sources
Tianwei Lei ... Zequn Niu
Journal of King Saud University - Computer and Information Sciences | VOL. 36
Tianwei Lei, et. al.Tianwei Lei ... Zequn Niu
28 Dec 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 36

POSTER
Ignacio Martín ... Antonio Guzmán
-
Ignacio Martín, et. al.Ignacio Martín ... Antonio Guzmán
24 Oct 2016
24 Oct 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Analysis of Android Malware Classification Services.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors