Abstract

The e-mail's header session usually contains important attributes such as e-mail title, sender's name, sender's e-mail address, sending date, which are helpful to classification of e-mails. In this paper, we apply decision tree data mining technique to header's basic attributes to analyze the association rules of spam e-mails and propose an efficient spam filtering method to accurately identify spam and legitimate e-mails. According to the experiment of applying numerous Chinese e-mails to our spam filtering method, we obtain the following excellent datums: the Accuracy is 96.5%, the Precision is 96.67%, and the Recall is 96.3%. Thus, the method proposed in this paper can efficiently identify the spam e-mails by checking only the header sessions, which can reduce the cost for calculation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call