Efficient Email Spam Classification with N-gram Features and Ensemble Learning

Prachi Bhatnagar Prachi Bhatnagar,Dr Sheshang Degadwala Degadwala

doi:10.32628/cseit2410220

Abstract

In this paper, we present an innovative approach to enhancing email spam classification using N-gram features, TF-IDF weighting, SMOTE oversampling, and ensemble learning techniques such as Decision Trees, Random Forests, and Ensemble Extra Trees. Our methodology involves preprocessing the dataset to extract N-gram features, applying TF-IDF weighting to highlight important terms, and addressing class imbalance through SMOTE. We then train and evaluate multiple classification models and find that the Ensemble Extra Trees algorithm outperforms others in terms of accuracy, precision, recall, and F1-score. Our experiments on benchmark datasets confirm the efficacy of our approach, showcasing significant improvements in spam detection accuracy and highlighting the potential of ensemble learning for email spam classification. This research contributes to the advancement of spam filtering technologies, providing a robust and efficient solution for accurately identifying and categorizing spam emails.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Email Spam Classification with N-gram Features and Ensemble Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Mar 28, 2024
License type: CC BY 4.0

Similar Papers

A Novel Four-Way Approach Designed With Ensemble Feature Selection for Code Smell Detection
Inderpreet Kaur ... Arvinder Kaur
IEEE access : practical innovations, open solutions | VOL. 9
Inderpreet Kaur, et. al.Inderpreet Kaur ... Arvinder Kaur
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 9

A review on rainfall forecasting using ensemble learning techniques
Saranagata Kundu ... Sudipta Mandal
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 6
Saranagata Kundu, et. al.Saranagata Kundu ... Sudipta Mandal
23 Sep 2023
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 6

Analyzing Amazon Products Sentiment: A Comparative Study of Machine and Deep Learning, and Transformer-Based Techniques
Hashir Ali ... Sule Yayilgan Yildirim
Electronics | VOL. 13
Hashir Ali, et. al.Hashir Ali ... Sule Yayilgan Yildirim
31 Mar 2024
Electronics | VOL. 13

CHRONIC KIDNEY DISEASE DETECTION USING ENSEMBLE LEARNING TECHNIQUES AND COMPARATIVE STUDY
A Gowtham ... Ch Kesava Manikanta
EPRA International Journal of Research & Development (IJRD) | VOL. -
A Gowtham, et. al. A Gowtham ... Ch Kesava Manikanta
05 Apr 2024
EPRA International Journal of Research & Development (IJRD) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Email Spam Classification with N-gram Features and Ensemble Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology