Sentiment Analysis of Algerian Dialect Using Machine Learning and Deep Learning with Word2vec

Ahmed Cherif Mazari,Abdelhamid Djeffal

doi:10.31449/inf.v46i6.3340

Ahmed Cherif Mazari, Abdelhamid Djeffal

Open Access

https://doi.org/10.31449/inf.v46i6.3340

Copy DOI

Abstract

In this paper, we deal with the issue of sentiment analysis on dialectal comments extracted from social media. These comments concern the Algerian spoken language, written in Arabic and/or Latin characters, which could be either Modern Standard Arabic, French or local dialect. This complexity gives rise to a large number of text processing issues. The contributions of this work are fourfold. First, we build an Algerian dialect sentiment dataset of 11760 comments collecting from diverse social media platforms. Second, we also create Skip-Gram and CBOW model by word2vec from a corpus containing 466424 comments, these latter are used for enhancing the sentiment dataset by semantically similar words. Third, we propose an adapted preprocessing step set to deal with dialectal texts. Finally, we implement and conduct different machine learning classifiers (SVM, Naive Bayes via its three variants (Bernoulli NB, Gaussian NB and Multinomial NB)) and two deep learning architectures (CNN, RNN) to evaluate and compare the dataset in original version, in a transcribed to Latin character version and then in a semantically-enhanced version by word2vec models . Experiments reach performances of sentiment classifiers applied on "dataset transcribed to Latin characters" of accuracies = (MNB:84.21%, CNN:64.11%) and on "transcribed dataset and enhanced by word2vec models" of accuracies = (SVM:83.70%, RNN:65.21%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Informatica	Publication Date: Jul 29, 2022
Citations: 11	License type: cc-by

R Discovery Prime

R Discovery Prime

Sentiment Analysis of Algerian Dialect Using Machine Learning and Deep Learning with Word2vec

Abstract

Talk to us

Similar Papers

More From: Informatica

Lead the way for us

Similar Papers

Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning
Retno Kusumaningrum ... Adi Wibowo
International Journal of Advances in Intelligent Informatics | VOL. 7
Retno Kusumaningrum, et. al.Retno Kusumaningrum ... Adi Wibowo
30 Nov 2021
International Journal of Advances in Intelligent Informatics | VOL. 7

Sentiment Analysis for Arabic Language Using Word Embedding
Osama Elsamadony ... Arabi Keshk
-
Osama Elsamadony, et. al.Osama Elsamadony ... Arabi Keshk
29 Dec 2021
29 Dec 2021

A deep learning analysis on question classification task using Word2vec representations
Seyhmus Yilmaz ... Sinan Toklu
Neural Computing and Applications | VOL. 32
Seyhmus Yilmaz, et. al.Seyhmus Yilmaz ... Sinan Toklu
21 Jan 2020
Neural Computing and Applications | VOL. 32

Sentiment analysis with machine learning and deep learning: A survey of techniques and applications
Nikhil Sanjay Suryawanshi
International Journal of Science and Research Archive | VOL. 12
Nikhil Sanjay Suryawanshi Nikhil Sanjay Suryawanshi
30 Jul 2024
International Journal of Science and Research Archive | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sentiment Analysis of Algerian Dialect Using Machine Learning and Deep Learning with Word2vec

Abstract

Talk to us

Similar Papers

More From: Informatica