A LSTM based framework for handling multiclass imbalance in DGA botnet detection

Duc Tran,Hai Anh Tran,Van Tong,Hieu Mac,Linh Giang Nguyen

doi:10.1016/j.neucom.2017.11.018

Abstract

Abstract In recent years, botnets have become a major threat on the Internet. Most sophisticated bots use Domain Generation Algorithms (DGA) to pseudo-randomly generate a large number of domains and select a subset in order to communicate with Command and Control (C&C) server. The basic aim is to avoid blacklisting, sinkholing and evade the security systems. Long Short-Term Memory network (LSTM) provides a mean to combat this botnet type. It operates on raw domains and is amenable to immediate applications. LSTM is however prone to multiclass imbalance problem, which becomes even more significant in DGA malware detection. This is due the fact that many DGA classes have a very little support in the training dataset. This paper presents a novel LSTM.MI algorithm to combine both binary and multiclass classification models, where the original LSTM is adapted to be cost-sensitive. The cost items are introduced into backpropagation learning procedure to take into account the identification importance among classes. Experiments are carried out on a real-world collected dataset. They demonstrate that LSTM.MI provides an improvement of at least 7% in terms of macro-averaging recall and precision as compared to the original LSTM and other state-of-the-art cost-sensitive methods. It is also able to preserve the high accuracy on non-DGA generated class (0.9849 F1-score), while helping recognize 5 additional bot families.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A LSTM based framework for handling multiclass imbalance in DGA botnet detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Nov 16, 2017
Citations: 180

Similar Papers

DeepDGA-MINet: Cost-Sensitive Deep Learning Based Framework for Handling Multiclass Imbalanced DGA Detection
R Vinayakumar ... K P Soman
-
R Vinayakumar, et. al.R Vinayakumar ... K P Soman
01 Jan 2020
01 Jan 2020

WEB PAGE CLASSIFICATION WITH DEEP LEARNING METHODS
Mehmet Salih Kurt ... Eylem Yücel Demi̇rel
Uludağ University Journal of The Faculty of Engineering | VOL. -
Mehmet Salih Kurt, et. al.Mehmet Salih Kurt ... Eylem Yücel Demi̇rel
16 Mar 2022
Uludağ University Journal of The Faculty of Engineering | VOL. -

Inline Detection of Domain Generation Algorithms with Context-Sensitive Word Embeddings
Joewie J Koh ... Barton Rhodes
-
Joewie J Koh, et. al.Joewie J Koh ... Barton Rhodes
21 Nov 2018
21 Nov 2018

Detecting Unknown DGAs without Context Information
Arthur Drichel ... Ulrike Meyer
-
Arthur Drichel, et. al.Arthur Drichel ... Ulrike Meyer
23 Aug 2022
23 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A LSTM based framework for handling multiclass imbalance in DGA botnet detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing