Dynamic Malware Analysis Research Articles

Malware refers to software that is designed to achieve a malicious purpose usually to benefit its creator. To accomplish this, malware hides its true purpose from its target and malware analysts until it has established a foothold on the victim’s machine. Malware analysts, therefore, have to find increasingly sophisticated methods to detect malware prompting malware authors to increase the number of evasive techniques employed by their malware. Dynamic malware analysis has been framed as a potential solution as it runs malware in its preferred environment to ensure that it observes its true behaviour. However, it is usually a restricted form of the preferred environment and malware may only be run for two minutes or less. This means that if malware does not demonstrate its malicious intent within that time frame and environment, the behaviour observed and subsequently learned may not be the behaviour that needs to be prevented. There is a risk that classifiers trained using the standard dynamic malware analysis process will only recognise malware by its evasive behaviour rather than a mix of behaviours. In this paper, we study the extent to which classifiers are dependent on evasive behaviour when identifying malware. We achieve this by training them on real ransomware and benignware and then testing their ability to detect carefully crafted simulated ransomware. The simulated ransomware gives us the freedom to create samples with different levels of evasive and malicious behaviour. The simulated samples, like the real samples, are run in a sandboxed environment where data is collected at a user- and Kernel-level. The results of our experiments indicated that, in general, the classifiers were more likely to label the simulated samples as malicious once the amount of evasive behaviour present in a sample went beyond a threshold. Generally, this threshold was crossed when the simulated ransomware waited 2 s or more between each file it encrypted. Additionally, the classifiers trained on the user-level data were not as robust against small changes in system calls made. Whereas, when trained on system calls gathered at a Kernel, system-wide level, the classifiers’ results were less variable. Finally, in attempting to simulate malware for our experiments, we discovered that the field of malware simulation is relatively unstudied despite its potential and therefore provide recommendations for simulating malware for system-call analysis.

Read full abstract

Malware-based cyber-attacks are mainly aimed at obtaining sensitive data, intellectual property theft, denying critical services and data, and financial gain. Malware has continuously evolved, becoming more sophisticated and evasive, and thus it remains a major cyber-security threat. To keep pace with malware’s evolution, there is a critical need to develop new, advanced malware detection methods. Widely-used solutions, such as antivirus software and other static host-based intrusion detection systems, have limitations, particularly in detecting new, unknown, and evasive malware. Many of the limitations of static analysis can be overcome when dynamic malware analysis is leveraged by machine learning (ML) algorithms by executing the malware in an isolated environment (e.g., sandbox), which enables the acquisition of rich behavioral and time-oriented information associated with malware behavior. Prior studies have proposed various detection methods based on dynamically extracted API calls for malware detection, but other than simple order-based approaches, the use of more advanced time-based methods has not been explored. In this paper, we propose a more comprehensive detection framework which, by analyzing the raw multivariate time-series data associated with malware execution, can accurately capture malware behavior and provide clear explainability regarding malware behavior and detection model decisions. We are the first to mine and automatically discover meaningful and explainable time-interval temporal API call patterns associated with malware behavior and leverage them, using a variety of ML algorithms, for malware detection and categorization. To evaluate our proposed solution, we established a comprehensive dynamic-analysis environment using Cuckoo Sandbox and analyzed more than 17,000 portable executables executed in Windows 10, the most widely-used operating system today. We conducted extensive experiments on malware detection and categorization and compared the performance of our solution to state-of-the-art methods, including non-time-oriented (classic ML algorithms) and order-based methods (LSTM networks). The results show that our proposed solution outperforms the other methods, obtaining 99.6% detection accuracy for unknown malware and 97.65% categorization accuracy. In a more complex scenario of detecting an unknown malware type with unseen modus operandi, our method obtained almost 90% detection accuracy, outperforming the state-of-the-art methods. To demonstrate our ability to provide human explainability, we present some temporal patterns of different malware families that we discovered which shed light on malware behavior that can be used by cyber-security experts to better understand malware, better defend against future attacks, and even attribute malware campaigns to the cyber-attackers launching them.

Read full abstract

Dynamic Malware Analysis Research Articles

Related Topics

Articles published on Dynamic Malware Analysis

Bane or Boon: Measuring the effect of evasive malware on system call classifiers

Image-based malware classification hybrid framework based on space-filling curves

Time-interval temporal patterns can beat and explain the malware

Network Traffic Analysis Using Machine Learning Techniques in IoT Networks

An approach to dynamic malware analysis based on system and application code split

Research and Application of Malware Classification Method Based on LSTM

Investigating Malware Propagation and Behaviour Using System and Network Pixel-Based Visualisation

Curious-Monkey: Evolved Monkey for Triggering Malicious Payloads in Android Malware

CSForest: an approach for imbalanced family classification of android malicious applications

MEGDroid: A model-driven event generation framework for dynamic android malware analysis

Intelligent Dynamic Malware Detection using Machine Learning in IP Reputation for Forensics Data Analytics

Detection of Malicious Servers for Preventing Client-Side Attacks

Resurrecting anti-virtualization and anti-debugging: Unhooking your hooks

Combat Mobile Evasive Malware via Skip-Gram-Based Malware Detection

Dynamic Malware Analysis with Feature Engineering and Feature Learning

IoT Botnet Forensics: A Comprehensive Digital Forensic Case Study on Mirai Botnet Servers

Forensic Analysis of a Ransomware

Towards Increasing Trust In Expert Evidence Derived From Malware Forensic Tools

Visualizing the outcome of dynamic analysis of Android malware with VizMal

Dynamic Malware Analysis in the Modern Era—A State of the Art Survey

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dynamic Malware Analysis Research Articles

Related Topics

Articles published on Dynamic Malware Analysis

Bane or Boon: Measuring the effect of evasive malware on system call classifiers

Image-based malware classification hybrid framework based on space-filling curves

Time-interval temporal patterns can beat and explain the malware

Network Traffic Analysis Using Machine Learning Techniques in IoT Networks

An approach to dynamic malware analysis based on system and application code split

Research and Application of Malware Classification Method Based on LSTM

Investigating Malware Propagation and Behaviour Using System and Network Pixel-Based Visualisation

Curious-Monkey: Evolved Monkey for Triggering Malicious Payloads in Android Malware

CSForest: an approach for imbalanced family classification of android malicious applications

MEGDroid: A model-driven event generation framework for dynamic android malware analysis

Intelligent Dynamic Malware Detection using Machine Learning in IP Reputation for Forensics Data Analytics

Detection of Malicious Servers for Preventing Client-Side Attacks

Resurrecting anti-virtualization and anti-debugging: Unhooking your hooks

Combat Mobile Evasive Malware via Skip-Gram-Based Malware Detection

Dynamic Malware Analysis with Feature Engineering and Feature Learning

IoT Botnet Forensics: A Comprehensive Digital Forensic Case Study on Mirai Botnet Servers

Forensic Analysis of a Ransomware

Towards Increasing Trust In Expert Evidence Derived From Malware Forensic Tools

Visualizing the outcome of dynamic analysis of Android malware with VizMal

Dynamic Malware Analysis in the Modern Era—A State of the Art Survey