Abstract

The Support Vector Machine(SVM)based spam filter was summarized briefly.The mail vector was constructed on TF-IDF model and Bernoulli model.The effect to mail classification of CHI method to descend dimension was tested in detail.Kernel based SVM was introduced into spam filtering.The classification accuracy and training time of SVM based on linear kernel,polynomial kernel and radius basis function kernel were compared and analyzed.It was proposed and analyzed that the imbalance of training samples has great affect on the classification accuracy and the false positive ratio.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call