Email phishing: text classification using natural language processing

Priyanka Verma,Anjali Goyal,Yogita Gigras

doi:10.11591/csit.v1i1.p1-12

Priyanka Verma, Anjali Goyal + Show 1 more

Open Access

https://doi.org/10.11591/csit.v1i1.p1-12

Copy DOI

Abstract

Phishing is networked theft in which the main motive of phishers is to steal any person’s private information, its financial details like account number, credit card details, login information, payment mode information by creating and developing a fake page or a fake web site, which look completely authentic and genuine. Nowadays email phishing has become a big threat to all, and is increasing day by day. Moreover detection of phishing emails have been considered an important research issue as phishing emails have been increasing day by day. Various techniques have been introduced and applied to deal with such a big issue. The major objective of this research paper is giving a detailed description on the classification of phishing emails using the natural language processing concepts. NLP (natural language processing) concepts have been applied for the classification of emails, along with that accuracy rate of various classifiers have been calculated. The paper is presented in four sections. An introduction about phishing its types, its history, statistics, life cycle, motivation for phishers and working of email phishing have been discussed in the first section. The second section covers various technologies of phishing- email phishing and also description of evaluation metrics. An overview of the various proposed solutions and work done by researchers in this field in form of literature review has been presented in the third section. The solution approach and the obtained results have been defined in the fourth section giving a detailed description about NLP concepts and working procedure.

Highlights

Phishing is basically a networked theft in which the main motive of phishers is to steal any person’s private information, its financial details like account number, credit card details, login information, payment mode info and many more
This section gives a description on the history and statistics, life cycle, motivation for phishers, email phishing and its working
Given below the various evaluation metrics: True positive rate (TPR): It states the ratio of phishing mails detected with respect to all malicious and genuine mails

Summary

INTRODUCTION

Phishing is basically a networked theft in which the main motive of phishers is to steal any person’s private information, its financial details like account number, credit card details, login information, payment mode info and many more. Many fake sites are available and are used by phishers to fraud people by sending fake mails and steal their private info or make them a victim of email phishing by sending any kind of malicious link or pop-up in mails that the user will unknowingly open and got stuck in their trap. It is a form of fraud in which the attacker represents himself to be genuine entity and attack via communication channels. The malicious content to target an upper level person like the CEO or the person's role in the company is created

BACKGROUND

EVALUATION METRICES

TAXONOMY OF PHISHING ATTACKS

Spoofed websites

LITERATURE REVIEW

Preprocessing of Data

Generation of Datasets for Testing and Training the Model

Results

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Science and Information Technologies	Publication Date: May 1, 2020
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Email phishing: text classification using natural language processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Science and Information Technologies

Lead the way for us

Similar Papers

Email phishing: Text classification using natural language processing
Priyanka Verma ... Yogita Gigras
Computer Science and Information Technologies | VOL. 1
Priyanka Verma, et. al.Priyanka Verma ... Yogita Gigras
01 May 2020
Computer Science and Information Technologies | VOL. 1

AN ENHANCEMENT ON TARGETED PHISHING ATTACKS IN THE STATE OF QATAR

-

11 Oct 2019
11 Oct 2019

Combining Natural Language Processing of Electronic Medical Notes With Administrative Data to Determine Racial/Ethnic Differences in the Disclosure and Documentation of Military Sexual Trauma in Veterans.
Adi V Gundlapalli ... Andrew Redd
Medical care | VOL. Suppl 57 6 2
Adi V Gundlapalli, et. al.Adi V Gundlapalli ... Andrew Redd
01 Jun 2019
Medical care | VOL. Suppl 57 6 2

A Comparative Analysis of Anti-Phishing Mechanisms: Email Phishing
...
international journal of advanced research in computer science | VOL. 8
, et. al. ...
30 Apr 2017
international journal of advanced research in computer science | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Email phishing: text classification using natural language processing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Science and Information Technologies