DGA domain embedding with deep metric learning

Yifan Yang,Lingbin Zeng,Bingnan Hou,Xionglve Li,Wenyuan Kuang,Zhiping Cai,Tao Yang

doi:10.1093/comjnl/bxae072

Abstract

Abstract Botnets currently use domain-generation algorithms to produce fast-flux domains that enable them to evade detection. Accurately categorizing these botnet domains is crucial to develop cybersecurity solutions against botnet threats. However, existing methods, requiring labeled data, are ineffective against new botnets. To address this issue, we propose Domain2Vec, a metric learning-based approach that can explore new botnets. Domain2Vec integrates a framework of metric learning, which uses individual domains from known botnets for categorization of unknown botnet domains. The training involves an attention-based encoder, and it includes a constraint to ensure that samples with the same labels are closer in the embedding space. The categorization uses the encoder to project domain names into appropriate representations (numerical vectors), even for domains from new botnets. Finally, Domain2Vec uses numerical vectors to explore botnets. Experiments showed that Domain2Vec performs well on domain retrieval and clustering tasks without labeled data, outperforming the state of the art by 13% and 100%, respectively. Real-world tests demonstrate that Domain2Vec can effectively identify unreported malicious domains and monitor botnet activities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DGA domain embedding with deep metric learning

Abstract

Talk to us

Similar Papers

More From: The Computer Journal

Lead the way for us

Similar Papers

Divide and Conquer the Embedding Space for Metric Learning
Artsiom Sanakoyeu ... Bjorn Ommer
-
Artsiom Sanakoyeu, et. al.Artsiom Sanakoyeu ... Bjorn Ommer
01 Jun 2019
01 Jun 2019

Improving Deep Metric Learning by Divide and Conquer.
Artsiom Sanakoyeu ... Vadim Tschernezki
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Artsiom Sanakoyeu, et. al.Artsiom Sanakoyeu ... Vadim Tschernezki
01 Jan 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Autoencoder-based unsupervised clustering and hashing
Bolin Zhang ... Jiangbo Qian
Applied Intelligence | VOL. 51
Bolin Zhang, et. al.Bolin Zhang ... Jiangbo Qian
19 Aug 2020
Applied Intelligence | VOL. 51

Deep metric learning via group channel-wise ensemble
Ping Li ... Xianghua Xu
Knowledge-Based Systems | VOL. 259
Ping Li, et. al.Ping Li ... Xianghua Xu
19 Oct 2022
Knowledge-Based Systems | VOL. 259

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DGA domain embedding with deep metric learning

Abstract

Talk to us

Similar Papers

More From: The Computer Journal