Detection of phishing websites using a novel twofold ensemble model

Kalyan Nagaraj,Sharvani Gs,Biplab Bhattacharjee,Amulyashree Sridhar

doi:10.1108/jsit-09-2017-0074

Abstract

PurposePhishing is one of the major threats affecting businesses worldwide in current times. Organizations and customers face the hazards arising out of phishing attacks because of anonymous access to vulnerable details. Such attacks often result in substantial financial losses. Thus, there is a need for effective intrusion detection techniques to identify and possibly nullify the effects of phishing. Classifying phishing and non-phishing web content is a critical task in information security protocols, and full-proof mechanisms have yet to be implemented in practice. The purpose of the current study is to present an ensemble machine learning model for classifying phishing websites.Design/methodology/approachA publicly available data set comprising 10,068 instances of phishing and legitimate websites was used to build the classifier model. Feature extraction was performed by deploying a group of methods, and relevant features extracted were used for building the model. A twofold ensemble learner was developed by integrating results from random forest (RF) classifier, fed into a feedforward neural network (NN). Performance of the ensemble classifier was validated using k-fold cross-validation. The twofold ensemble learner was implemented as a user-friendly, interactive decision support system for classifying websites as phishing or legitimate ones.FindingsExperimental simulations were performed to access and compare the performance of the ensemble classifiers. The statistical tests estimated that RF_NN model gave superior performance with an accuracy of 93.41 per cent and minimal mean squared error of 0.000026.Research limitations/implicationsThe research data set used in this study is publically available and easy to analyze. Comparative analysis with other real-time data sets of recent origin must be performed to ensure generalization of the model against various security breaches. Different variants of phishing threats must be detected rather than focusing particularly toward phishing website detection.Originality/valueThe twofold ensemble model is not applied for classification of phishing websites in any previous studies as per the knowledge of authors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of phishing websites using a novel twofold ensemble model

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Information Technology

Lead the way for us

Journal: Journal of Systems and Information Technology	Publication Date: Oct 18, 2018
Citations: 14

Similar Papers

Comparative analysis of thermal preference prediction performance in different conditions using ensemble learning models based on ASHRAE Comfort Database II
Yan Bai ... Yuying Wang
Building and Environment | VOL. 223
Yan Bai, et. al.Yan Bai ... Yuying Wang
01 Sep 2022
Building and Environment | VOL. 223

Trainable segmentation for transmission electron microscope images of inorganic nanoparticles.
Cameron G Bell ... Kevin P Treder
Journal of Microscopy | VOL. 288
Cameron G Bell, et. al.Cameron G Bell ... Kevin P Treder
11 May 2022
Journal of Microscopy | VOL. 288

Investigating machine learning and ensemble learning models in groundwater potential mapping in arid region: case study from Tan-Tan water-scarce region, Morocco
Abdessamad Jari ... Mustapha Namous
Frontiers in Water | VOL. 5
Abdessamad Jari, et. al.Abdessamad Jari ... Mustapha Namous
13 Dec 2023
Frontiers in Water | VOL. 5

A Hybrid Images Deep Trained Feature Extraction and Ensemble Learning Models for Classification of Multi Disease in Fundus Images
Jyoti Verma ... Daljeet Singh
-
Jyoti Verma, et. al.Jyoti Verma ... Daljeet Singh
01 Jan 2024
01 Jan 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of phishing websites using a novel twofold ensemble model

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Information Technology