Abstract

Abstract: As the popularity of mobile phone devices has increased, Short Message Service (SMS) has grown into a multi-billion dollars industry. At the same time, reduction in the cost of messaging services has resulted in growth in unsolicited commercial advertisements (spams) being sent to mobile phones. In parts of Asia, up to 30% of text messages were spam in 2012. Lack of real databases for SMS spams, short length of messages and limited features, and their informal language are the factors that may cause the established email filtering algorithms to underperform in their classification. In this project, a dataset of real SMS Spams from UCI Machine Learning repository is used, and after pre-processing and vectorization, different machine learning algorithms are applied to the dataset. Finally, the results are compared and the best algorithm for spam filtering for text messaging is introduced and converted into an open-source website. The SMS spam collection set is used for testing the method. After collecting the various supervised learning algorithms, we find that the Multinomial Naïve Bayes algorithm gives us 97.1% Accuracy and 100% Precision

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call