Phishing page detection via learning classifiers from page layout feature

Jian Mao,Tao Wei,Zhenkai Liang,Aili Li,Jingdong Bian,Shishi Zhu,Wenqian Tian

doi:10.1186/s13638-019-1361-0

Abstract

The web technology has become the cornerstone of a wide range of platforms, such as mobile services and smart Internet-of-things (IoT) systems. In such platforms, users’ data are aggregated to a cloud-based platform, where web applications are used as a key interface to access and configure user data. Securing the web interface requires solutions to deal with threats from both technical vulnerabilities and social factors. Phishing attacks are one of the most commonly exploited vectors in social engineering attacks. The attackers use web pages visually mimicking legitimate web sites, such as banking and government services, to collect users’ sensitive information. Existing phishing defense mechanisms based on URLs or page contents are often evaded by attackers. Recent research has demonstrated that visual layout similarity can be used as a robust basis to detect phishing attacks. In particular, features extracted from CSS layout files can be used to measure page similarity. However, it needs human expertise in specifying how to measure page similarity based on such features. In this paper, we aim to enable automated page-layout-based phishing detection techniques using machine learning techniques. We propose a learning-based aggregation analysis mechanism to decide page layout similarity, which is used to detect phishing pages. We prototype our solution and evaluate four popular machine learning classifiers on their accuracy and the factors affecting their results.

Highlights

The web technology has become the cornerstone of a wide range of platforms, such as mobile services and smart Internet-of-things (IoT) systems
All the classifiers show more than 93% accuracy and more than 84% F1, which demonstrates that our approach can make an effective detection in phishing websites
With respect to other approaches, our method is light-weight as it only takes one class of features, Cascading Style Sheets (CSS) structure, as the input to identify the similarity of web pages and detect phishing attacks

Summary

Introduction

The web technology has become the cornerstone of a wide range of platforms, such as mobile services and smart Internet-of-things (IoT) systems. Features extracted from CSS layout files are used to measure page similarity These measurements heavily rely on human experiences and may not be comprehensive to detect new attacks. In our previous work [8, 9], we have demonstrated that CSS-based page layout features can be used as the basis to detect phishing pages, where we convert CSS into a normalized representation called influence vector. It consists of two parts: a property, and one or more declarations. The selectors can be classified into four categories tag, ID, class, and others

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Wireless Communications and Networking	Publication Date: Feb 20, 2019
Citations: 42	License type: open-access

R Discovery Prime

R Discovery Prime

Phishing page detection via learning classifiers from page layout feature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Wireless Communications and Networking

Lead the way for us

Similar Papers

Social engineering attack framework
Francois Mouton ... H.S Venter
-
Francois Mouton, et. al.Francois Mouton ... H.S Venter
01 Aug 2014
01 Aug 2014

Detecting Phishing Websites via Aggregation Analysis of Page Layouts
Jian Mao ... Zhenkai Liang
Procedia Computer Science | VOL. 129
Jian Mao, et. al.Jian Mao ... Zhenkai Liang
01 Jan 2018
Procedia Computer Science | VOL. 129

Social engineering attack examples, templates and scenarios
Francois Mouton ... H.S Venter
Computers & Security | VOL. 59
Francois Mouton, et. al.Francois Mouton ... H.S Venter
21 Mar 2016
Computers & Security | VOL. 59

Social Engineering Attacks on Facebook – A Case Study
Abdul Shareef Pallivalappil ... Krishna Prasad K
International Journal of Case Studies in Business, IT, and Education | VOL. -
Abdul Shareef Pallivalappil, et. al.Abdul Shareef Pallivalappil ... Krishna Prasad K
09 Dec 2021
International Journal of Case Studies in Business, IT, and Education | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phishing page detection via learning classifiers from page layout feature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Wireless Communications and Networking