Abstract

Twitter Spam has turned out to be a significant predicament of these days. Current works concern on exploiting the machine learning models to detect the spams in Twitter by determining the statistic features of the tweets. Even though these models result in better success, it is hard to sustain the performances attained by the supervised approaches. This paper intends to introduce a deep learning-assisted spam classification model on twitter. This classification is based on sentiments and topics modeled in it. The initial step is data collection. Subsequently, the collected data are preprocessed with “stop word removal, stemming and tokenization”. The next step is feature extraction, wherein, the post tagging, headwords, rule-based lexicon, word length, and weighted holoentropy features are extracted. Then, the proposed sentiment score extraction is carried out to analyze their variations in nonspam and spam information. At last, the diffusions of spam data on Twitter are classified into spam and nonspams. For this, an Optimized Deep Ensemble technique is introduced that encloses “neural network (NN), support vector machine (SVM), random forest (RF) and convolutional neural network (DNN)”. Particularly, the weights of DNN are optimally tuned by an arithmetic crossover-based cat swarm optimization (AC-CS) model. At last, the supremacy of the developed approach is examined via evaluation over extant techniques. Accordingly, the proposed AC-CS [Formula: see text] ensemble model attained better accuracy value when the learning percentage is 80, which is 18.1%, 14.89%, 11.7%, 12.77%, 10.64%, 6.38%, 6.38%, and 6.38% higher than SVM, DNN, RNN, DBN, MFO [Formula: see text] ensemble model, WOA [Formula: see text] ensemble model, EHO [Formula: see text] ensemble model and CSO [Formula: see text] ensemble model models.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call