Assessing LLMs in malicious code deobfuscation of real-world malware campaigns

Constantinos Patsakis,Fran Casino,Nikolaos Lykousas

doi:10.1016/j.eswa.2024.124912

Constantinos Patsakis, Fran Casino + Show 1 more

Open Access

https://doi.org/10.1016/j.eswa.2024.124912

Copy DOI

Export

Save

Cite

Journal: Expert Systems With Applications	Publication Date: Jul 31, 2024
Citations: 2	License type: cc-by-nc

Abstract
Full-Text
Similar Papers

Abstract

Listen

The integration of large language models (LLMs) into various cybersecurity pipelines has become increasingly prevalent, enabling the automation of numerous manual tasks and often surpassing human performance. Recognising this potential, cybersecurity researchers and practitioners are actively investigating the application of LLMs to process vast volumes of heterogeneous data for anomaly detection, potential bypass identification, attack mitigation, and fraud prevention. Moreover, LLMs’ advanced capabilities in generating functional code, interpreting code context, and code summarisation present significant opportunities for reverse engineering and malware deobfuscation.In this work, we comprehensively examine the deobfuscation capabilities of state-of-the-art LLMs. Specifically, we conducted a detailed evaluation of four prominent LLMs using real-world malicious scripts from the notorious Emotet malware campaign. Our findings reveal that while current LLMs are not yet perfectly accurate, they demonstrate substantial potential in efficiently deobfuscating payloads. This study highlights the importance of fine-tuning LLMs for specialised tasks, suggesting that such optimisation could pave the way for future AI-powered threat intelligence pipelines to combat obfuscated malware. Our contributions include a thorough analysis of LLM performance in malware deobfuscation, identifying strengths and limitations, and discussing the potential for integrating LLMs into cybersecurity frameworks for enhanced threat detection and mitigation. Our experiments illustrate that LLMs can automatically and accurately extract the necessary indicators of compromise from a real-world campaign with an accuracy of 69.56% and 88.78% for the URLs and the corresponding domains of the droppers, respectively.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Assessing LLMs in malicious code deobfuscation of real-world malware campaigns

Abstract

Published Version

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Assessment of decision-making with locally run and web-based large language models versus human board recommendations in otorhinolaryngology, head and neck surgery.
Christoph Raphael Buhr ... Jonas Eckrich
European archives of oto-rhino-laryngology : official journal of the European Federation of Oto-Rhino-Laryngological Societies (EUFOS) : affiliated with the German Society for Oto-Rhino-Laryngology - Head and Neck Surgery | VOL. -
Christoph Raphael Buhr, et. al.Christoph Raphael Buhr ... Jonas Eckrich
10 Jan 2025
10 Jan 2025

Large language models in cancer: potentials, risks, and safeguards.
M M Zitu ... T Thieu
BJR artificial intelligence | VOL. 2
M M Zitu, et. al.M M Zitu ... T Thieu
20 Dec 2024
BJR artificial intelligence | VOL. 2

Large language models in neurosurgery: a systematic review and meta-analysis.
Advait Patil ... Kevin T Huang
Acta neurochirurgica | VOL. 166
Advait Patil, et. al.Advait Patil ... Kevin T Huang
23 Nov 2024
Acta neurochirurgica | VOL. 166

LLM Supply Chain Provenance: A Blockchain-based Approach
Shridhar Singh ... Luke Vorster
International Conference on AI Research | VOL. 4
Shridhar Singh, et. al.Shridhar Singh ... Luke Vorster
04 Dec 2024
International Conference on AI Research | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Assessing LLMs in malicious code deobfuscation of real-world malware campaigns

Abstract

Published Version

Talk to us

Similar Papers

More From: Expert Systems With Applications