New Spam Filtering Method with Hadoop Tuning-Based MapReduce Na飗e Bayes

Keungyeup Ji,Youngmi Kwon

doi:10.32604/csse.2023.031270

Abstract

As the importance of email increases, the amount of malicious email is also increasing, so the need for malicious email filtering is growing. Since it is more economical to combine commodity hardware consisting of a medium server or PC with a virtual environment to use as a single server resource and filter malicious email using machine learning techniques, we used a Hadoop MapReduce framework and Naïve Bayes among machine learning methods for malicious email filtering. Naïve Bayes was selected because it is one of the top machine learning methods(Support Vector Machine (SVM), Naïve Bayes, K-Nearest Neighbor(KNN), and Decision Tree) in terms of execution time and accuracy. Malicious email was filtered with MapReduce programming using the Naïve Bayes technique, which is a supervised machine learning method, in a Hadoop framework with optimized performance and also with the Python program technique with the Naïve Bayes technique applied in a bare metal server environment with the Hadoop environment not applied. According to the results of a comparison of the accuracy and predictive error rates of the two methods, the Hadoop MapReduce Naïve Bayes method improved the accuracy of spam and ham email identification 1.11 times and the prediction error rate 14.13 times compared to the non-Hadoop Python Naïve Bayes method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Systems Science and Engineering	Publication Date: Jan 1, 2023
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

New Spam Filtering Method with Hadoop Tuning-Based MapReduce Na飗e Bayes

Abstract

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering

Lead the way for us

Similar Papers

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

Classification of malicious emails
Eva Markova ... Tomas Bajtos
-
Eva Markova, et. al.Eva Markova ... Tomas Bajtos
01 Nov 2019
01 Nov 2019

Machine learning technique for morphological classification of galaxies from the SDSS
I B Vavilova ... V Khramtsov
Astronomy & Astrophysics | VOL. 648
I B Vavilova, et. al.I B Vavilova ... V Khramtsov
01 Apr 2021
Astronomy & Astrophysics | VOL. 648

Introducing SSDs to the Hadoop MapReduce Framework
Sangwhan Moon ... Jaehwan Lee
-
Sangwhan Moon, et. al.Sangwhan Moon ... Jaehwan Lee
01 Jun 2014
01 Jun 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New Spam Filtering Method with Hadoop Tuning-Based MapReduce Na飗e Bayes

Abstract

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering