Phishing Detection Approach Research Articles

Phishing attacks are evolving with more sophisticated techniques, posing significant threats. Considering the potential of machine-learning-based approaches, our research presents a similar modern approach for web phishing detection by applying powerful machine learning algorithms. An efficient layered classification model is proposed to detect websites based on their URL structure, text, and image features. Previously, similar studies have used machine learning techniques for URL features with a limited dataset. In our research, we have used a large dataset of 20,000 website URLs, and 22 salient features from each URL are extracted to prepare a comprehensive dataset. Along with this, another dataset containing website text is also prepared for NLP-based text evaluation. It is seen that many phishing websites contain text as images, and to handle this, the text from images is extracted to classify it as spam or legitimate. The experimental evaluation demonstrated efficient and accurate phishing detection. Our layered classification model uses support vector machine (SVM), XGBoost, random forest, multilayer perceptron, linear regression, decision tree, naïve Bayes, and SVC algorithms. The performance evaluation revealed that the XGBoost algorithm outperformed other applied models with maximum accuracy and precision of 94% in the training phase and 91% in the testing phase. Multilayer perceptron also worked well with an accuracy of 91% in the testing phase. The accuracy results for random forest and decision tree were 91% and 90%, respectively. Logistic regression and SVM algorithms were used in the text-based classification, and the accuracy was found to be 87% and 88%, respectively. With these precision values, the models classified phishing and legitimate websites very well, based on URL, text, and image features. This research contributes to early detection of sophisticated phishing attacks, enhancing internet user security.

Read full abstract

Phishing attacks aim to steal confidential information using sophisticated methods, techniques, and tools such as phishing through content injection, social engineering, online social networks, and mobile applications. To avoid and mitigate the risks of these attacks, several phishing detection approaches were developed, among which deep learning algorithms provided promising results. However, the results and the corresponding lessons learned are fragmented over many different studies and there is a lack of a systematic overview of the use of deep learning algorithms in phishing detection. Hence, we performed a systematic literature review (SLR) to identify, assess, and synthesize the results on deep learning approaches for phishing detection as reported by the selected scientific publications. We address nine research questions and provide an overview of how deep learning algorithms have been used for phishing detection from several aspects. In total, 43 journal articles were selected from electronic databases to derive the answers for the defined research questions. Our SLR study shows that except for one study, all the provided models applied supervised deep learning algorithms. The widely used data sources were URL-related data, third party information on the website, website content-related data, and email. The most used deep learning algorithms were deep neural networks (DNN), convolutional neural networks, and recurrent neural networks/long short-term memory networks. DNN and hybrid deep learning algorithms provided the best performance among other deep learning-based algorithms. 72% of the studies did not apply any feature selection algorithm to build the prediction model. PhishTank was the most used dataset among other datasets. While Keras and Tensorflow were the most preferred deep learning frameworks, 46% of the articles did not mention any framework. This study also highlights several challenges for phishing detection to pave the way for further research.

Read full abstract

Phishing Detection Approach Research Articles

Related Topics

Articles published on Phishing Detection Approach

Illegitimate Websites Detection Using Deep Learning Framework

Phishing URL Detection: A Basic Machine Learning Approach

Hybrid optimization enabled squeeze net for phishing attack detection

A Review on Online Phishing Detection Using Machine Learning

A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning.

Look before you leap: Detecting phishing web pages by exploiting raw URL and HTML characteristics

An enhanced deep learning‐based phishing detection mechanism to effectively identify malicious URLs using variational autoencoders

Business Email Compromise Phishing Detection Based on Machine Learning: A Systematic Literature Review

Applications of deep learning for phishing detection: a systematic literature review.

Deep Learning Based Phishing Websites Detection

Modeling Hybrid Feature-Based Phishing Websites Detection Using Machine Learning Techniques

HinPhish: An Effective Phishing Detection Approach Based on Heterogeneous Information Networks

An adaptive approach for internet phishing detection based on log data

A novel approach for phishing URLs detection using lexical based machine learning in a real-time environment

Phishing website detection based on effective machine learning approach

The Answer is in the Text: Multi-Stage Methods for Phishing Detection Based on Feature Engineering

Optimization of URL-Based Phishing Websites Detection through Genetic Algorithms

A Case-Based Reasoning Approach for Automatic Adaptation of Classifiers in Mobile Phishing Detection

Classifier Performance Evaluation of Phishing Detection Model on Optimal Number of Clusters

Phishing Website Detection Based on Multidimensional Features Driven by Deep Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Phishing Detection Approach Research Articles

Related Topics

Articles published on Phishing Detection Approach

Illegitimate Websites Detection Using Deep Learning Framework

Phishing URL Detection: A Basic Machine Learning Approach

Hybrid optimization enabled squeeze net for phishing attack detection

A Review on Online Phishing Detection Using Machine Learning

A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning.

Look before you leap: Detecting phishing web pages by exploiting raw URL and HTML characteristics

An enhanced deep learning‐based phishing detection mechanism to effectively identify malicious URLs using variational autoencoders

Business Email Compromise Phishing Detection Based on Machine Learning: A Systematic Literature Review

Applications of deep learning for phishing detection: a systematic literature review.

Deep Learning Based Phishing Websites Detection

Modeling Hybrid Feature-Based Phishing Websites Detection Using Machine Learning Techniques

HinPhish: An Effective Phishing Detection Approach Based on Heterogeneous Information Networks

An adaptive approach for internet phishing detection based on log data

A novel approach for phishing URLs detection using lexical based machine learning in a real-time environment

Phishing website detection based on effective machine learning approach

The Answer is in the Text: Multi-Stage Methods for Phishing Detection Based on Feature Engineering

Optimization of URL-Based Phishing Websites Detection through Genetic Algorithms

A Case-Based Reasoning Approach for Automatic Adaptation of Classifiers in Mobile Phishing Detection

Classifier Performance Evaluation of Phishing Detection Model on Optimal Number of Clusters

Phishing Website Detection Based on Multidimensional Features Driven by Deep Learning