Towards more realistic evaluations: The impact of label delays in malware detection pipelines

Marcus Botacin,Heitor Gomes

doi:10.1016/j.cose.2024.104122

Abstract

Developing and evaluating malware classification pipelines to reflect real-world needs is as vital to protect users as it is hard to achieve. In many cases, the experimental conditions when the approach was developed and the deployment settings mismatch, which causes the solutions not to achieve the desired results. In this work, we explore how unrealistic project and evaluation decisions in the literature are. In particular, we shed light on the problem of label delays, i.e., the assumption that ground-truth labels for classifier retraining are always available when in the real world they take significant time to be produced, which also causes a significant attack opportunity window. In our analyses, among diverse aspects, we address: (1) The use of metrics that do not account for the effect of time; (2) The occurrence of concept drift and ideal assumptions about the amount of drift data a system can handle; and (3) Ideal assumptions about the availability of oracle data for drift detection and the need for relying on pseudo-labels for mitigating drift-related delays. We present experiments based on a newly proposed exposure metric to show that delayed labels due to limited analysis queue sizes impose a significant challenge for detection (e.g., up to a 75% greater attack opportunity in the real world than in the experimental setting) and that pseudo-labels are useful in mitigating the delays (reducing the detection loss to only 30% of the original value).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards more realistic evaluations: The impact of label delays in malware detection pipelines

Abstract

Talk to us

Similar Papers

More From: Computers & Security

Lead the way for us

Similar Papers

Sources of bias in observational studies of covid-19 vaccine effectiveness.
Kaiser Fung ... Mark Jones
Journal of Evaluation in Clinical Practice | VOL. 30
Kaiser Fung, et. al.Kaiser Fung ... Mark Jones
26 Mar 2023
Journal of Evaluation in Clinical Practice | VOL. 30

Alternative Paths Toward a Low Carbon World
Valentina Bosetti ... Massimo Tavoni
SSRN Electronic Journal | VOL. -
Valentina Bosetti, et. al.Valentina Bosetti ... Massimo Tavoni
16 Jun 2010
SSRN Electronic Journal | VOL. -

Alternative Paths toward a Low Carbon World
...
-
, et. al. ...
01 Jun 2010
01 Jun 2010

A Robust Service Selection Method Based on Uncertain QoS
Yanping Chen ... Lu Jiang
Mathematical Problems in Engineering | VOL. 2016
Yanping Chen, et. al.Yanping Chen ... Lu Jiang
01 Jan 2015
Mathematical Problems in Engineering | VOL. 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards more realistic evaluations: The impact of label delays in malware detection pipelines

Abstract

Talk to us

Similar Papers

More From: Computers & Security