Abstract

The constantly evolving phishing attacks have forced scientists to counter such an attack. Most phishing attacks come with dangerous links or URLs that can potentially lead to data or information leaking to third parties. Machine Learning has been proven great in many cases of both known and unknown types of analysis. Its ability to both give reasoning and “self-taught” in addition with easier duplication makes it great to counter phishing attacks. This paper compares different machine learning algorithms to detect whether a URL is a legitimate URL or a phishing URL with a certain feature using a Web page phishing detection dataset. The machine learning algorithms that were compared are Naive Bayes, K-Nearest Neighbor, Random Forest, Decision Tree, Support Vector Machine, and Logistic Regression. The models were trained using a phishing dataset that has been passed through a preprocessing and encoding layer. The model resulting accuracy, along with other evaluation metrics are recorded and compared to each other. The results are pretty close with the highest accuracy claimed by the Random Forest algorithm which is 98,04% out of 11429 URLs.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.