CNN–MHSA: A Convolutional Neural Network and multi-head self-attention combined approach for detecting phishing websites

Xi Xiao,Dianyan Zhang,Guangwu Hu,Yong Jiang,Shutao Xia

doi:10.1016/j.neunet.2020.02.013

Abstract

Increasing phishing sites today have posed great threats due to their terribly imperceptible hazard. They expect users to mistake them as legitimate ones so as to steal user information and properties without notice. The conventional way to mitigate such threats is to set up blacklists. However, it cannot detect one-time Uniform Resource Locators (URL) that have not appeared in the list. As an improvement, deep learning methods are applied to increase detection accuracy and reduce the misjudgment ratio. However, some of them only focus on the characters in URLs but ignore the relationships between characters, which results in that the detection accuracy still needs to be improved. Considering the multi-head self-attention (MHSA) can learn the inner structures of URLs, in this paper, we propose CNN–MHSA, a Convolutional Neural Network (CNN) and the MHSA combined approach for highly-precise. To achieve this goal, CNN–MHSA first takes a URL string as the input data and feeds it into a mature CNN model so as to extract its features. In the meanwhile, MHSA is applied to exploit characters’ relationships in the URL so as to calculate the corresponding weights for the CNN learned features. Finally, CNN–MHSA can produce highly-precise detection result for a URL object by integrating its features and their weights. The thorough experiments on a dataset collected in real environment demonstrate that our method achieves 99.84% accuracy, which outperforms the classical method CNN–LSTM and at least 6.25% higher than other similar methods on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CNN–MHSA: A Convolutional Neural Network and multi-head self-attention combined approach for detecting phishing websites

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: Feb 29, 2020
Citations: 74

Similar Papers

A Joint Approach to Detect Malicious URL Based on Attention Mechanism
Yongfang Peng ... Shengwei Tian
International Journal of Computational Intelligence and Applications | VOL. 18
Yongfang Peng, et. al.Yongfang Peng ... Shengwei Tian
01 Sep 2019
International Journal of Computational Intelligence and Applications | VOL. 18

Phishing websites detection via CNN and multi-head self-attention on imbalanced datasets
Xi Xiao ... Shutao Xia
Computers & Security | VOL. 108
Xi Xiao, et. al.Xi Xiao ... Shutao Xia
16 Jun 2021
Computers & Security | VOL. 108

Detecting phishing websites through improving convolutional neural networks with Self-Attention mechanism
Yahia Said ... Tawfeeq Shawly
Ain Shams Engineering Journal | VOL. 15
Yahia Said, et. al.Yahia Said ... Tawfeeq Shawly
22 Jan 2024
Ain Shams Engineering Journal | VOL. 15

A hybrid deep learning technique for spoofing website URL detection in real-time applications
Bridget C Ujah-Ogbuagu ... Emeka Ogbuju
Journal of Electrical Systems and Information Technology | VOL. 11
Bridget C Ujah-Ogbuagu, et. al.Bridget C Ujah-Ogbuagu ... Emeka Ogbuju
24 Jan 2024
Journal of Electrical Systems and Information Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNN–MHSA: A Convolutional Neural Network and multi-head self-attention combined approach for detecting phishing websites

Abstract

Talk to us

Similar Papers

More From: Neural Networks