URL filtering using machine learning algorithms

Asia Othman Aljahdalic,Shoroq Banafee,Thana Aljohani

doi:10.1080/19393555.2023.2193350

Abstract

ABSTRACT Cyber-attacks using malicious uniform resource locator (URL) propagation are very common and serious. Statistics indicate that there is a need to research and apply techniques and methods for identifying and preventing malicious URLs. The main objective of this research is to train machine learning models on selected dataset to predict phishing websites based on URL-related features. The accuracy level of each model is measured and compared. Finally, the best performing model will be used to develop a web application that provide internet users with an easy way to check suspicious URLs. We have used five different machine learning models to classify URLs as legitimate or phishing, these models are eXtreme Gradient Boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), Decision Tree, and Random Forest. Finally, we used Voting Classifier to combine the work of Random Forest (RF) algorithm with other two models, Gaussian Naive Bayes, and Logistic Regression, to check if we can increase the accuracy of RF as suggested in the literature, but we found that the accuracy of RF alone was higher than the accuracy of the combined models. This project can be implemented as a browser extension or mobile application to classify suspicious URLs to legitimate or phishing with the use of the saved model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

URL filtering using machine learning algorithms

Abstract

Talk to us

Similar Papers

More From: Information Security Journal: A Global Perspective

Lead the way for us

Similar Papers

You Look Suspicious!!: Leveraging Visible Attributes to Classify Malicious Short URLs on Twitter
Raj Kumar Nepali ... Yong Wang
-
Raj Kumar Nepali, et. al.Raj Kumar Nepali ... Yong Wang
01 Jan 2015
01 Jan 2015

Exploring Efficiency of GAN-based Generated URLs for Phishing URL Detection
Tuan Dung Pham ... Sy Tuong Hoang
-
Tuan Dung Pham, et. al.Tuan Dung Pham ... Sy Tuong Hoang
01 Oct 2021
01 Oct 2021

SQL-IDS
Naghmeh Moradpoor Sheykhkanloo
-
Naghmeh Moradpoor SheykhkanlooNaghmeh Moradpoor Sheykhkanloo
08 Sep 2015
08 Sep 2015

Employing Neural Networks for the Detection of SQL Injection Attack
Naghmeh Moradpoor Sheykhkanloo
-
Naghmeh Moradpoor SheykhkanlooNaghmeh Moradpoor Sheykhkanloo
09 Sep 2014
09 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

URL filtering using machine learning algorithms

Abstract

Talk to us

Similar Papers

More From: Information Security Journal: A Global Perspective