Abstract
Email spam is an increasing problem because it disrupting and time consuming for user, since the easy and cheap of sending email. Email Spam filtering can be done with a binary classification with machine learning as classifier. To date, email spam detection still challenging since the email spam still happens a lot and the detection still need improvement. Decision Tree (DT) is one of famous classifier since DT able to handle nominal and numerical attributes and increasing the efficiency of computing. However, DT has a weakness in over-sensitivity to the training set and the noise data or instance that can degrade the performance. In this study, we propose hybrid combination Logistic Regression (LR) and DT for email spam detection. LR is used for reduce noisy data or instance before data feed to DT induction. Noisy data reducing is done by LR by filtering correct prediction with certain false negative threshold. In this study, Spambase dataset is used to evaluate the proposed method. From the experiment, the result shows that proposed method yield impressive and promising result with the accuracy is 91.67%. It can be concluded that LR able to improve DT performance by reducing noisy data.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.