SMS Spam Filteration Using Text Features and Supervised Machine Learning Algorithms

Rashmi Pandey Rashmi Pandey,Pushpendra Prajapati Pushpendra Prajapati,Vibhanshu Kumar Singh Vibhanshu Kumar Singh,Mayank Tyagi Mayank Tyagi,Chetan Anand Amb Chetan Anand Amb

doi:10.32628/cseit2410452

Abstract

Over time, technological advancements have had an immense effect on every aspect of life, including travel, office work, music, healthcare, and communication. In the past, people communicated using telephone lines. With far more functionality than telephone cable technology, wireless technology already prevails. SMS is mostly used by spammers and advertising firms to communicate with the general public and distribute company pamphlets. This explains why over 60% of spam SMS are sent and received every day. Although these spam communications irritate users and occasionally con unsuspecting users, the spammers and ad businesses benefit handsomely from them. This paper suggested a method for categorizing ham and spam SMS using supervised machine learning approaches. Features are extracted from data using feature extraction techniques like bag-of- words and Term Frequency-Inverse Document Frequency (TF-IDF). The imbalance in the SMS dataset we used was addressed by applying both oversampling and under sampling techniques. The support vector classifier, gradient boosting machine, random forest, Gaussian Naive Bayes, and logistics regression are implemented on the using spam SMS and ham SMS data sets, evaluated by F1 score, accuracy, precision and recall are used to assess performance. According to the experiment's findings, the random forest diagnoses spam and ham SMS more precisely-99% of the time.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SMS Spam Filteration Using Text Features and Supervised Machine Learning Algorithms

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Nov 18, 2024
License type: CC BY 4.0

Similar Papers

Detecting Spam Emails/SMS Using Naive Bayes, Support Vector Machine and Random Forest
Vasudha Goswami ... Pratyush Sharma
-
Vasudha Goswami, et. al.Vasudha Goswami ... Pratyush Sharma
01 Jan 2020
01 Jan 2020

User Interface Bug Classification Model Using ML and NLP Techniques: A Comparative Performance Analysis of ML Models
Sara Khan ... Saurabh Pal
International Journal of Experimental Research and Review | VOL. 45
Sara Khan, et. al.Sara Khan ... Saurabh Pal
30 Nov 2024
International Journal of Experimental Research and Review | VOL. 45

SMS Spam Detection using H2O Framework
Dima Suleiman ... Ghazi Al-Naymat
Procedia Computer Science | VOL. 113
Dima Suleiman, et. al.Dima Suleiman ... Ghazi Al-Naymat
01 Jan 2017
Procedia Computer Science | VOL. 113

Semi-supervised novelty detection with one class SVM for SMS spam detection
Suleiman Y Yerima ... Abul Bashar
-
Suleiman Y Yerima, et. al.Suleiman Y Yerima ... Abul Bashar
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SMS Spam Filteration Using Text Features and Supervised Machine Learning Algorithms

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology