Mobile SMS Spam Filtering for Nepali Text Using Na&amp;#239;ve Bayesian and Support Vector Machine

Tej Bahadur Shahi,Abhimanu Yadav

doi:10.4236/ijis.2014.41004

Abstract

Spam is a universal problem with which everyone is familiar. A number of approaches are used for Spam filtering. The most common filtering technique is content-based filtering which uses the actual text of message to determine whether it is Spam or not. The content is very dynamic and it is very challenging to represent all information in a mathematical model of classification. For instance, in content-based Spam filtering, the characteristics used by the filter to identify Spam message are constantly changing over time. Na?ve Bayes method represents the changing nature of message using probability theory and support vector machine (SVM) represents those using different features. These two methods of classification are efficient in different domains and the case of Nepali SMS or Text classification has not yet been in consideration; these two methods do not consider the issue and it is interesting to find out the performance of both the methods in the problem of Nepali Text classification. In this paper, the Na?ve Bayes and SVM-based classification techniques are implemented to classify the Nepali SMS as Spam and non-Spam. An empirical analysis for various text cases has been done to evaluate accuracy measure of the classification methodologies used in this study. And, it is found to be 87.15% accurate in SVM and 92.74% accurate in the case of Na?ve Bayes.

Highlights

Spam can be defined as unsolicited email for a recipient or any email that the users do not wanted to have in their inboxes
Naïve Bayes and Support Vector Machine algorithms have been implemented for the Spam filtering task
The study has gone through the empirical analysis of the performance of both the Spam filters (SVM and Naïve Bayes) for Nepali SMS

Summary

Introduction

Spam can be defined as unsolicited (unwanted, junk) email for a recipient or any email that the users do not wanted to have in their inboxes. Spam filtering is a special problem in the field of document classification and machine learning. The technological development in mobile devices has increased in computational power, and other powerful systems have been capable to be connected to mobile phone networks. This has increased the communication through SMS. A mail consists of certain structured information such as subject, mail header, salutation, sender’s address etc. These make the SMS classification task much difficult. This situation makes the necessity for developing an efficient SMS filtering method.

Related Work

Methodology: A Proposed Framework for Spam SMS Filtering

Preprocessing

TF-IDF Calculation and Feature Vector Construction

Classification

Experimental Setup and Results

Findings

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Intelligence Science	Publication Date: Dec 17, 2013
Citations: 27	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Mobile SMS Spam Filtering for Nepali Text Using Na&#239;ve Bayesian and Support Vector Machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Intelligence Science

Lead the way for us

Similar Papers

Mobile SMS Spam Filtering for Nepali Text Using Naïve Bayesian and Support Vector Machine
Tej Bahadur Shahi ... Abhimanu Yadav
International Journal of Intelligence Science | VOL. 04
Tej Bahadur Shahi, et. al.Tej Bahadur Shahi ... Abhimanu Yadav
01 Jan 2014
International Journal of Intelligence Science | VOL. 04

An evaluation of text classification methods for literary study
B Yu
Literary and Linguistic Computing | VOL. 23
B YuB Yu
05 Sep 2008
Literary and Linguistic Computing | VOL. 23

Using Case-Based Reasoning for Spam Filtering

-

28 Jul 2008
28 Jul 2008

Text Classification Using FP-Growth Association Rule and Updating the Term Weight
Santosh K Vishwakarma ... Akhilesh Kumar Sharma
-
Santosh K Vishwakarma, et. al.Santosh K Vishwakarma ... Akhilesh Kumar Sharma
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mobile SMS Spam Filtering for Nepali Text Using Na&amp;#239;ve Bayesian and Support Vector Machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Intelligence Science

Mobile SMS Spam Filtering for Nepali Text Using Naïve Bayesian and Support Vector Machine