Phishing URL detection with neural networks: an empirical study

Hayk Ghalechyan,Elina Israyelyan,Avag Arakelyan,Gerasim Hovhannisyan,Arman Davtyan

doi:10.1038/s41598-024-74725-6

Abstract

Cybercriminals create phishing websites that mimic legitimate websites to get sensitive information from companies, individuals, or governments. Therefore, using state-of-the-art artificial intelligence and machine learning technologies to correctly classify phishing and legitimate URLs is imperative. We report the results of applying deterministic and probabilistic neural network models to URL classification. Key achievements of this work are: (1) The development of a unique approach based on probabilistic neural networks that improves classification accuracy. (2) We show for the first time in URL phishing research that a machine learning model trained on a combination of open source and private datasets is successful in production. The dataset is constructed from open sources like Alexa, PhishTank, or OpenPhish and, most importantly, real-world production data from EasyDMARC. The daily validation of the model using daily reported URL data and corresponding labels, both from open-source platforms and private production, reach on average a 97% accuracy on the validation dataset, labeled by PhishTank, OpenPhish and EasdDMARC where possible mislabeled data can not be excluded and was not possible to check due to large number of URLs. Feature engineering was done without third-party dependencies. Lastly, the evaluation of both deterministic and probabilistic models shows high accuracy on short and long URLs, where short URLs are defined as having less than 50 characters.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Phishing URL detection with neural networks: an empirical study

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Oct 24, 2024
License type: cc-by-nc-nd

Similar Papers

Generative adversarial network-based phishing URL detection with variational autoencoder and transformer
Jishnu Kaitholikkal Sasi ... Arthi Balakrishnan
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13
Jishnu Kaitholikkal Sasi, et. al.Jishnu Kaitholikkal Sasi ... Arthi Balakrishnan
01 Jun 2024
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13

Phishing Websites Spotting with Help of using Machine Learning Tools
Prasanth Baskaran ... Chandru G
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Prasanth Baskaran, et. al.Prasanth Baskaran ... Chandru G
31 May 2023
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

Phishing URL Detection with Lexical Features and Blacklisted Domains
Jiwon Hong ... Taeri Kim
-
Jiwon Hong, et. al.Jiwon Hong ... Taeri Kim
01 Jan 2020
01 Jan 2020

A Comparative Analysis of dot NET-Based and Open Source Platforms for Ontologies Development
Leki Jovial Mahoro ... Jean Vincent Fonou-Dombeu
-
Leki Jovial Mahoro, et. al.Leki Jovial Mahoro ... Jean Vincent Fonou-Dombeu
01 Aug 2020
01 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phishing URL detection with neural networks: an empirical study

Abstract

Talk to us

Similar Papers

More From: Scientific Reports