Unmasking phishers: ML for malicious certificate detection

Magnea Haraldsdóttir,Sajad Homayoun,Emil Lynge,Christan D Jensen

doi:10.1016/j.cie.2024.110652

Abstract

Phishing attacks increasingly use digital certificates to appear safe to users, and the frequency of such attacks has surged in recent years. As an example, around 80% of the 2021 phishing attacks used digital certificates to appear legitimate. The most common methods today for detecting phishing websites rely on users reporting the websites to phishing repositories, where they are then confirmed. This process can be slow, allowing the attacker to have time to have their phishing attack out on the Internet. Newer methods that implement machine learning models for the detection of phishing websites based on their digital certificate have been shown to be effective. This paper presents a system that uses certificate and domain name related features along with machine learning methods for the detection of phishing websites. To develop the system, data was collected from PhishTank and Tranco for domain names, and Censys was used for certificate retrieval. The domain related features are partly engineered using a time-series based deep learning model to get a vector representation of the domain name. Using the features engineered from the certificate and domain name, classical machine learning classifiers are trained and compared. Enriching the feature set with the vector representation of the domain names results in higher performance in distinguishing suspicious certificates from benign ones, going from an F1-score of 0.77 for a feature set solely based on certificate-related features to a performance of 0.89 with the enriched feature set. A time-based evaluation reflects the same performance with an F1-score of 0.88, which is an improvement compared to existing approaches to feature engineering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unmasking phishers: ML for malicious certificate detection

Abstract

Talk to us

Similar Papers

More From: Computers & Industrial Engineering

Lead the way for us

Similar Papers

Deep Learning Framework for Cyber Threat Situational Awareness Based on Email and URL Data Analysis
R Vinayakumar ... K P Soman
-
R Vinayakumar, et. al.R Vinayakumar ... K P Soman
01 Jan 2019
01 Jan 2019

Explainable Heart Disease Prediction Using Ensemble-Quantum Machine Learning Approach
Ghada Abdulsalam ... Hadil Shaiba
Intelligent Automation & Soft Computing | VOL. 36
Ghada Abdulsalam, et. al.Ghada Abdulsalam ... Hadil Shaiba
01 Jan 2023
Intelligent Automation & Soft Computing | VOL. 36

Phishing website detection using URL-assisted brand name weighting system
Choon Lin Tan ... Kang Leng Chiew
-
Choon Lin Tan, et. al.Choon Lin Tan ... Kang Leng Chiew
01 Dec 2014
01 Dec 2014

Improved DGA Domain Names Detection and Categorization Using Deep Learning Architectures with Classical Machine Learning Algorithms
R Vinayakumar ... S Akarsh
-
R Vinayakumar, et. al.R Vinayakumar ... S Akarsh
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unmasking phishers: ML for malicious certificate detection

Abstract

Talk to us

Similar Papers

More From: Computers & Industrial Engineering