(Un/Semi-)supervised SMS text message SPAM detection

Chris R Giannella,Brandon Wilson,Ransom Winder

doi:10.1017/s1351324914000102

Abstract

AbstractWe address the problem of unsupervised and semi-supervised SMS (Short Message Service) text message SPAM detection. We develop a content-based Bayesian classification approach which is a modest extension of the technique discussed by Resnik and Hardisty in 2010. The approach assumes that the bodies of the SMS messages arise from a probabilistic generative model and estimates the model parameters by Gibbs sampling using an unlabeled, or partially labeled, SMS training message corpus. The approach classifies new SMS messages as SPAM or HAM (non-SPAM) by zero-thresholding their logit estimates. We tested the approach on a publicly available SMS corpora collected from the UK. Used in semi-supervised fashion, the approach clearly outperformed a competing algorithm, Semi-Boost. Used in unsupervised fashion, the approach outperformed a fully supervised classifier, an SVM (Support Vector Machine), when the number of training messages used by the SVM was small and performed comparably otherwise. We believe the approach works well and is a useful tool for SMS SPAM detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

(Un/Semi-)supervised SMS text message SPAM detection

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Journal: Natural Language Engineering	Publication Date: Oct 15, 2014
Citations: 7

Similar Papers

Feasibility and acceptability of short message service (SMS) text messaging to support adherence in patients receiving quetiapine: A pilot study
D Volcke ... S Van Hoorde
European Psychiatry | VOL. 22
D Volcke, et. al.D Volcke ... S Van Hoorde
15 Feb 2007
European Psychiatry | VOL. 22

Development of a Theoretically Driven mHealth Text Messaging Application for Sustaining Recent Weight Loss.
Ryan J Shaw ... Susan G Silva
JMIR mHealth and uHealth | VOL. 1
Ryan J Shaw, et. al.Ryan J Shaw ... Susan G Silva
07 May 2013
JMIR mHealth and uHealth | VOL. 1

The Use of Mobile Apps and SMS Messaging as Physical and Mental Health Interventions: Systematic Review.
Amy Leigh Rathbone ... Julie Prescott
Journal of Medical Internet Research | VOL. 19
Amy Leigh Rathbone, et. al.Amy Leigh Rathbone ... Julie Prescott
24 Aug 2017
Journal of Medical Internet Research | VOL. 19

Resident Use of Text Messaging for Patient Care: Ease of Use or Breach of Privacy?
Micah T Prochaska ... Vineet M Arora
JMIR medical informatics | VOL. 3
Micah T Prochaska, et. al.Micah T Prochaska ... Vineet M Arora
26 Nov 2015
JMIR medical informatics | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

(Un/Semi-)supervised SMS text message SPAM detection

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering