Naïve Bayes classifiers for authorship attribution of Arabic texts

Alaa Saleh Altheneyan,Mohamed El Bachir Menai

doi:10.1016/j.jksuci.2014.06.006

Alaa Saleh Altheneyan, Mohamed El Bachir Menai

Open Access

https://doi.org/10.1016/j.jksuci.2014.06.006

Copy DOI

Abstract

Authorship attribution is the process of assigning an author to an anonymous text based on writing characteristics. Several authorship attribution methods were developed for natural languages, such as English, Chinese and Dutch. However, the number of related works for Arabic is limited. Naïve Bayes classifiers have been widely used for various natural language processing tasks. However, there is generally no mention of the event model used, which can have a considerable impact on the performance of the classifier. To the best of our knowledge, naïve Bayes classifiers have not yet been considered for authorship attribution in Arabic. Therefore, we propose to study their use for this problem, taking into account different event models, namely, simple naïve Bayes (NB), multinomial naïve Bayes (MNB), multi-variant Bernoulli naïve Bayes (MBNB) and multi-variant Poisson naïve Bayes (MPNB). We evaluate these models’ performances on a large Arabic dataset extracted from books of 10 different authors and compare them with other existing methods. The experimental results show that MBNB provides the best results and could attribute the author of a text with an accuracy of 97.43%. Comparison results with related methods indicate that MBNB and MNB are appropriate for authorship attribution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of King Saud University - Computer and Information Sciences	Publication Date: Sep 28, 2014
Citations: 44	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Naïve Bayes classifiers for authorship attribution of Arabic texts

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Similar Papers

On Authorship Attribution of Telugu Text
S Nagaprasad ... J K R Sastry
Indian Journal of Science and Technology | VOL. 9
S Nagaprasad, et. al.S Nagaprasad ... J K R Sastry
29 Sep 2016
Indian Journal of Science and Technology | VOL. 9

Telugu Text Classification Using Supervised Machine Learning Algorithm
G V Subba Raju ... Srinivasu Badugu
-
G V Subba Raju, et. al.G V Subba Raju ... Srinivasu Badugu
01 Jan 2021
01 Jan 2021

Investigating the Statistical Assumptions of Naïve Bayes Classifiers
Anthony Kelly ... Marc Anthony Johnson
-
Anthony Kelly, et. al.Anthony Kelly ... Marc Anthony Johnson
24 Mar 2021
24 Mar 2021

Optimize network intrusion detection system based on PCA feature extraction and three naïve bayes classifiers
Shaymaa A Kadom ... Soukaena H Hashem
Journal of Physics: Conference Series | VOL. 2322
Shaymaa A Kadom, et. al.Shaymaa A Kadom ... Soukaena H Hashem
01 Aug 2022
Journal of Physics: Conference Series | VOL. 2322

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Naïve Bayes classifiers for authorship attribution of Arabic texts

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences