Semantic Based Greedy Levy Gradient Boosting Algorithm for Phishing Detection

R Sakunthala Jenni,S Shankar

doi:10.32604/csse.2022.019300

Abstract

The detection of phishing and legitimate websites is considered a great challenge for web service providers because the users of such websites are indistinguishable. Phishing websites also create traffic in the entire network. Another phishing issue is the broadening malware of the entire network, thus highlighting the demand for their detection while massive datasets (i.e., big data) are processed. Despite the application of boosting mechanisms in phishing detection, these methods are prone to significant errors in their output, specifically due to the combination of all website features in the training state. The upcoming big data system requires MapReduce, a popular parallel programming, to process massive datasets. To address these issues, a probabilistic latent semantic and greedy levy gradient boosting (PLS-GLGB) algorithm for website phishing detection using MapReduce is proposed. A feature selection-based model is provided using a probabilistic intersective latent semantic preprocessing model to minimize errors in website phishing detection. Here, the missing data in each URL are identified and discarded for further processing to ensure data quality. Subsequently, with the preprocessed features (URLs), feature vectors are updated by the greedy levy divergence gradient (model) that selects the optimal features in the URL and accurately detects the websites. Thus, greedy levy efficiently differentiates between phishing websites and legitimate websites. Experiments are conducted using one of the largest public corpora of a website phish tank dataset. Results show that the PLS-GLGB algorithm for website phishing detection outperforms state-of-the-art phishing detection methods. Significant amounts of phishing detection time and errors are also saved during the detection of website phishing.

Highlights

Web security is a materializing inclination in novel big data settings
With the preprocessed features (URLs), feature vectors are updated by the greedy levy divergence gradient that selects the optimal features in the URL and accurately detects the websites
Conventional methods focus on the utilization of neural network models to address phishing attacks

Summary

Introduction

Web security is a materializing inclination in novel big data settings. Web security is directed by utilizing different methods, such as privacy preservation techniques, hidden Markov models, and reasoning-based strategies. Web phishing is the current pertinent interest. Phishing refers to the process of mimicking an official website of banks and social networking sites.

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Systems Science and Engineering	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Semantic Based Greedy Levy Gradient Boosting Algorithm for Phishing Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering

Lead the way for us

Similar Papers

SURVEY AND ANALYSIS ON PHISHING DETECTION TECHNIQUES
K Sumathi
International Journal of Advanced Research in Computer Science | VOL. 9
K SumathiK Sumathi
20 Feb 2018
International Journal of Advanced Research in Computer Science | VOL. 9

A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators.
Eman Abdullah Aldakheel ... Ghada Abdalaziz Gashgari
Sensors | VOL. 23
Eman Abdullah Aldakheel, et. al.Eman Abdullah Aldakheel ... Ghada Abdalaziz Gashgari
30 Apr 2023
Sensors | VOL. 23

Types of anti-phishing solutions for phishing attack
Siti Hawa Apandi ... Roslina Mohd Sidek
IOP Conference Series: Materials Science and Engineering | VOL. 769
Siti Hawa Apandi, et. al.Siti Hawa Apandi ... Roslina Mohd Sidek
01 Feb 2020
IOP Conference Series: Materials Science and Engineering | VOL. 769

Detection of Phishing Website using Machine Learning
Vaishnavi Bhoyar ... Dipali Gawali
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Vaishnavi Bhoyar, et. al. Vaishnavi Bhoyar ... Dipali Gawali
07 Jan 2024
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic Based Greedy Levy Gradient Boosting Algorithm for Phishing Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering