Actor Critic Deep Reinforcement Learning for Neural Malware Control

Yu Wang,Mady Marinescu,Jack Stokes

doi:10.1609/aaai.v34i01.5449

Abstract

In addition to using signatures, antimalware products also detect malicious attacks by evaluating unknown files in an emulated environment, i.e. sandbox, prior to execution on a computer's native operating system. During emulation, a file cannot be scanned indefinitely, and antimalware engines often set the number of instructions to be executed based on a set of heuristics. These heuristics only make the decision of when to halt emulation using partial information leading to the execution of the file for either too many or too few instructions. Also this method is vulnerable if the attackers learn this set of heuristics. Recent research uses a deep reinforcement learning (DRL) model employing a Deep Q-Network (DQN) to learn when to halt the emulation of a file. In this paper, we propose a new DRL-based system which instead employs a modified actor critic (AC) framework for the emulation halting task. This AC model dynamically predicts the best time to halt the file's execution based on a sequence of system API calls. Compared to the earlier models, the new model is capable of handling adversarial attacks by simulating their behaviors using the critic model. The new AC model demonstrates much better performance than both the DQN model and antimalware engine's heuristics. In terms of execution speed (evaluated by the halting decision), the new model halts the execution of unknown files by up to 2.5% earlier than the DQN model and 93.6% earlier than the heuristics. For the task of detecting malicious files, the proposed AC model increases the true positive rate by 9.9% from 69.5% to 76.4% at a false positive rate of 1% compared to the DQN model, and by 83.4% from 41.2% to 76.4% at a false positive rate of 1% compared to a recently proposed LSTM model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Actor Critic Deep Reinforcement Learning for Neural Malware Control

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 5

Similar Papers

Deep Reinforcement Learning for Automatic Drilling Optimization Using an Integrated Reward Function
Xu Huang ... Ted Furlong
-
Xu Huang, et. al.Xu Huang ... Ted Furlong
27 Feb 2024
27 Feb 2024

Explainable AI in Deep Reinforcement Learning Models: A SHAP Method Applied in Power System Emergency Control
Ke Zhang ... Peidong Xu
-
Ke Zhang, et. al.Ke Zhang ... Peidong Xu
30 Oct 2020
30 Oct 2020

High-Frequency Quantitative Trading of Digital Currencies Based on Fusion of Deep Reinforcement Learning Models with Evolutionary Strategies
Yijun He ... Bo Xu
Journal of Computing and Information Technology | VOL. 32
Yijun He, et. al.Yijun He ... Bo Xu
15 Jul 2024
Journal of Computing and Information Technology | VOL. 32

Watermarks for Deep Reinforcement Learning
Kangjie Chen
-
Kangjie ChenKangjie Chen
28 Nov 2022
28 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Actor Critic Deep Reinforcement Learning for Neural Malware Control

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence