Effect of feature selection methods on machine learning classifiers for detecting email spams

Shrawan Kumar Trivedi,Shubhamoy Dey

doi:10.1145/2513228.2513313

Effect of feature selection methods on machine learning classifiers for detecting email spams

Shrawan Kumar Trivedi, Shubhamoy Dey

https://doi.org/10.1145/2513228.2513313

Copy DOI

Publication Date: Oct 1, 2013

Citations: 23

Affiliation: Indian Institute of Management Indore

#Greedy Stepwise Search #Effect Of Feature Selection Methods + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This research presents the effects of using features selected by two feature selection methods i.e. Genetic Search and Greedy Stepwise Search on popular Machine Learning Classifiers like Bayesian, Naive Bayes, Support Vector Machine and Genetic Algorithm. Tests were performed on two different publicly available spam email datasets: Enron and SpamAssassin. Results show that, Greedy Stepwise Search is a good method for feature selection for spam email detection. Among the Machine Learning Classifiers, Support Vector Machine has been found to be the best both in terms of accuracy and False Positive rate

Full Text