Authorship Authentication of Short Messages from Social Networks Machines

Nesibe Merve Demir,Mehmet Can

doi:10.21533/scjournal.v7i1.148

Abstract

Dataset consists of 17000 tweets collected from Twitter, as 500 tweets for each of 34 authors that meet certain criteria. Raw data is collected by using the software Nvivo. The collected raw data is preprocessed to extract frequencies of 200 features. In the data analysis 128 of features are eliminated since they are rare in tweets. As a progressive presentation, five – fifteen – twenty – twenty five – thirty and thirty four of these authors are selected each time. Since recurrent artificial neural networks are more stable and in general ANNs are more successful distinguishing two classes, for N authors, N×N neural networks are trained for pair wise classification. These experts then organized in N competing teams (CANNT) to aggregate decisions of these NXN experts. Then this procedure is repeated seven times and committees with seven members voted for final decision. By a commonest type voting, the accuracy is boosted around ten percent. Number of authors is seen not so effective on the accuracy of the authentication, and around 80% accuracy is achieved for any number of authors.

Highlights

Normalization was done by dividing each value by the total word count of the corresponding text, in order to remove the influence of different overall text size.Feature vectors, created by extracting from Twitter messages, were used as input for modeling artificial neural network (ANN)
Since ANNs are more successful distinguishing two classes, for N authors, N×N neural networks are trained for pair wise classification
These experts organized as N special teams (CANNT) with N experts to aggregate decisions

Summary

INTRODUCTION

Green, and Sheppard (2013) focused on messages collected from Twitter to analyze most effective feature sets for authorship verification They used sequential minimal optimization (SMO) algorithm included in Weka for classification 10 authors with 120 tweets from each and had 44% accuracy rate. They compared style makers (SM) feature sets and bag-of-words (BOW) feature sets and informed that SM features are more effective than BOW features for authorship verification. Obtained features were applied for training a linear SVM classifier for prediction of an unknown tweet's author Their results showed that if data number increased, better results were obtained. Ada boost classifier received the best results with 84% accuracy for 5 authors

A BRIEF NOTE ON ANNS

Multi Layer Perceptrons for Forecasting

Network Architecture

Bootstrapping

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Southeast Europe Journal of Soft Computing	Publication Date: May 10, 2018
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Authorship Authentication of Short Messages from Social Networks Machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Southeast Europe Journal of Soft Computing

Lead the way for us

Similar Papers

Authorship Authentication of Short Messages from Social Networks Using Recurrent Artificial Neural Networks
Nesibe Merve Demir
Southeast Europe Journal of Soft Computing | VOL. 7
Nesibe Merve DemirNesibe Merve Demir
28 Nov 2018
Southeast Europe Journal of Soft Computing | VOL. 7

Authorship Authentication of Short Messages from Social Networks Using Recurrent Artificial Neural Networks: Massage Batches
Nesibe Merve Demir
Southeast Europe Journal of Soft Computing | VOL. 7
Nesibe Merve DemirNesibe Merve Demir
10 May 2018
Southeast Europe Journal of Soft Computing | VOL. 7

A hybrid deep recurrent artificial neural network with a simple exponential smoothing feedback mechanism
Ozlem Karahasan ... Erol Egrioglu
Information Sciences | VOL. 686
Ozlem Karahasan, et. al.Ozlem Karahasan ... Erol Egrioglu
22 Aug 2024
Information Sciences | VOL. 686

The Prediction Model of Cotton Yarn Quality Based on Artificial Recurrent Neural Network
Zhenlong Hu ... Qiang Zhao
-
Zhenlong Hu, et. al.Zhenlong Hu ... Qiang Zhao
31 Jul 2019
31 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Authorship Authentication of Short Messages from Social Networks Machines

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Southeast Europe Journal of Soft Computing