A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.

Tian Xia,Jiacun Wang,Xuemin Chen,Feng Qiu

doi:10.3390/s23218975

Abstract

Short message services (SMS), microblogging tools, instant message apps, and commercial websites produce numerous short text messages every day. These short text messages are usually guaranteed to reach mass audience with low cost. Spammers take advantage of short texts by sending bulk malicious or unwanted messages. Short texts are difficult to classify because of their shortness, sparsity, rapidness, and informal writing. The effectiveness of the hidden Markov model (HMM) for short text classification has been illustrated in our previous study. However, the HMM has limited capability to handle new words, which are mostly generated by informal writing. In this paper, a hybrid model is proposed to address the informal writing issue by weighting new words for fast short text filtering with high accuracy. The hybrid model consists of an artificial neural network (ANN) and an HMM, which are used for new word weighting and spam filtering, respectively. The weight of a new word is calculated based on the weights of its neighbor, along with the spam and ham (i.e., not spam) probabilities of short text message predicted by the ANN. Performance evaluations on benchmark datasets, including the SMS message data maintained by University of California, Irvine; the movie reviews, and the customer reviews are conducted. The hybrid model operates at a significantly higher speed than deep learning models. The experiment results show that the proposed hybrid model outperforms other prominent machine learning algorithms, achieving a good balance between filtering throughput and accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Nov 4, 2023
License type: CC BY 4.0

Similar Papers

Transferring topical knowledge from auxiliary long texts for short text clustering
Ou Jin ... Qiang Yang
-
Ou Jin, et. al.Ou Jin ... Qiang Yang
24 Oct 2011
24 Oct 2011

Feasibility and acceptability of SMS text messaging in a prostate cancer educational intervention for African American men.
Daisy Le ... Annie Coriolan
Health Informatics Journal | VOL. 22
Daisy Le, et. al.Daisy Le ... Annie Coriolan
26 Jul 2016
Health Informatics Journal | VOL. 22

The effectiveness of short mobile phone text message reminders compared to usual care on medication adherence in patients with hypertension: a systematic review protocol
Abebe Muche Belete ... Taklo Simeneh Yazie
Systematic reviews | VOL. 13
Abebe Muche Belete, et. al.Abebe Muche Belete ... Taklo Simeneh Yazie
05 Feb 2024
Systematic reviews | VOL. 13

Contextual correlation based thread detection in short text message streams
Jiuming Huang ... Quanyuan Wu
Journal of Intelligent Information Systems | VOL. 38
Jiuming Huang, et. al.Jiuming Huang ... Quanyuan Wu
25 May 2011
Journal of Intelligent Information Systems | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)