Early-stage malware prediction using recurrent neural networks

Matilda Rhode,Pete Burnap,Kevin Jones

doi:10.1016/j.cose.2018.05.010

Matilda Rhode, Pete Burnap + Show 1 more

Open Access

https://doi.org/10.1016/j.cose.2018.05.010

Copy DOI

Abstract

Static malware analysis is well-suited to endpoint anti-virus systems as it can be conducted quickly by examining the features of an executable piece of code and matching it to previously observed malicious code. However, static code analysis can be vulnerable to code obfuscation techniques. Behavioural data collected during file execution is more difficult to obfuscate, but takes a relatively long time to capture - typically up to 5 min, meaning the malicious payload has likely already been delivered by the time it is detected.In this paper we investigate the possibility of predicting whether or not an executable is malicious based on a short snapshot of behavioural data. We find that an ensemble of recurrent neural networks are able to predict whether an executable is malicious or benign within the first 5 s of execution with 94% accuracy. This is the first time general types of malicious file have been predicted to be malicious during execution rather than using a complete activity log file post-execution, and enables cyber security endpoint protection to be advanced to use behavioural data for blocking malicious payloads rather than detecting them post-execution and having to repair the damage.

Highlights

Automatic malware detection is necessary to process the rapidly rising rate and volume of new malware being generated
The main contributions of this paper are: 1. We propose a recurrent neural network (RNN) model to predict malicious behaviour using machine activity data and demonstrate its capabilities are superior to other machine learning solutions that have previously been used for malware detection
The code used to implement the following experiments can be found at https://github.com/mprhode/malware-prediction-rnn

Summary

Introduction

Automatic malware detection is necessary to process the rapidly rising rate and volume of new malware being generated. Automatic malware detection used in anti-virus systems compares (features extracted from) the code of an incoming file to a known list of malware signatures. This form of filtering using static data is unsuited to detecting completely new (“zero-day”). Behavioural analysis approaches assume that malware cannot avoid leaving a measurable footprint as a result of the actions necessary for it to achieve its aims. Executing the malware incurs a time penalty by comparison with static analysis. Whilst dynamic data can lead to more accurate and resilient detection models than static data ([4], [5], [6]), in practice behavioural data is rarely used in commercial endpoint anti-virus systems due to this time penalty. It is inconvenient and inefficient to wait for several minutes whilst a single file is analysed, and the malicious payload has likely been delivered by the end of the analysis window so the opportunity to block malicious actions has been missed

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers & Security	Publication Date: May 22, 2018
Citations: 231	License type: cc-by

R Discovery Prime

R Discovery Prime

Early-stage malware prediction using recurrent neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers & Security

Lead the way for us

Similar Papers

Ensemble of Multi-time Resolution Recurrent Neural Networks for Enhanced Feature Extraction in High-Rate Time Series
Vahid Barzegar ... Chao Hu
-
Vahid Barzegar, et. al.Vahid Barzegar ... Chao Hu
10 May 2021
10 May 2021

English
Viktor Buyankin ... S Kovaleva
-
Viktor Buyankin, et. al.Viktor Buyankin ... S Kovaleva
06 Mar 2013
06 Mar 2013

Ensemble of recurrent neural networks with long short-term memory cells for high-rate structural health monitoring
Vahid Barzegar ... Jacob Dodson
Mechanical Systems and Signal Processing | VOL. 164
Vahid Barzegar, et. al.Vahid Barzegar ... Jacob Dodson
19 Jul 2021
Mechanical Systems and Signal Processing | VOL. 164

Audio bandwidth extension using ensemble of recurrent neural networks
Xin Liu ... Chang-Chun Bao
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2016
Xin Liu, et. al.Xin Liu ... Chang-Chun Bao
12 May 2016
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Early-stage malware prediction using recurrent neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers &amp; Security

More From: Computers & Security