Algorithmically generated malicious domain names detection based on n-grams features

Alessandro Cucchiarelli,Christian Morbidoni,Luca Spalazzi,Marco Baldi

doi:10.1016/j.eswa.2020.114551

Abstract

Botnets are one of the major cyber infections used in several criminal activities. In most botnets, a Domain Generation Algorithm (DGA) is used by bots to make DNS queries aimed at establishing the connection with the Command and Control (C&C) server. The identification of such queries by monitoring the network DNS traffic is then crucial for bot detection. In this paper we present a methodology to detect DGA generated domain names based on a supervised machine learning process, trained with a dataset of known benign and malicious domain names. The proposed approach represents the domain names through a set of features which express the similarity between the 2-grams and 3-grams in a single unclassified domain name and those in domain names known as malicious or benign. We used the Kullback-Leibner divergence and the Jaccard Index to estimate the similarity, and we tested different machine learning algorithms to classify each domain name as benign or DGA-based (with both binary and multi-class approach). The results of our experiments demonstrate that the proposed methodology, which only exploits lexical features of domain names, attains a good level of accuracy and results in a general model able to classify previously unseen domains in an effective way. It is also able to outperform some of the state-of-the-art featur eless classification methods based on deep learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Algorithmically generated malicious domain names detection based on n-grams features

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Dec 30, 2020
Citations: 35

Similar Papers

DeepDGA-MINet: Cost-Sensitive Deep Learning Based Framework for Handling Multiclass Imbalanced DGA Detection
R Vinayakumar ... K P Soman
-
R Vinayakumar, et. al.R Vinayakumar ... K P Soman
01 Jan 2020
01 Jan 2020

Improved DGA Domain Names Detection and Categorization Using Deep Learning Architectures with Classical Machine Learning Algorithms
R Vinayakumar ... S Akarsh
-
R Vinayakumar, et. al.R Vinayakumar ... S Akarsh
01 Jan 2019
01 Jan 2019

Detecting Multielement Algorithmically Generated Domain Names Based on Adaptive Embedding Model
Luhui Yang ... Jiangtao Zhai
Security and Communication Networks | VOL. 2021
Luhui Yang, et. al.Luhui Yang ... Jiangtao Zhai
31 May 2021
Security and Communication Networks | VOL. 2021

Domain generation algorithms detection with feature extraction and Domain Center construction.
Xinjie Sun ... Zhifang Liu
PLOS ONE | VOL. 18
Xinjie Sun, et. al.Xinjie Sun ... Zhifang Liu
27 Jan 2023
PLOS ONE | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Algorithmically generated malicious domain names detection based on n-grams features

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications