Multi-Class Sentiment Analysis of Social Media Data with Machine Learning Algorithms

Galimkair Mutanov,Vladislav Karyukin,Zhanl Mamykova

doi:10.32604/cmc.2021.017827

Galimkair Mutanov, Vladislav Karyukin + Show 1 more

Open Access

https://doi.org/10.32604/cmc.2021.017827

Copy DOI

Abstract

The volume of social media data on the Internet is constantly growing. This has created a substantial research field for data analysts. The diversity of articles, posts, and comments on news websites and social networks astonishes imagination. Nevertheless, most researchers focus on posts on Twitter that have a specific format and length restriction. The majority of them are written in the English language. As relatively few works have paid attention to sentiment analysis in the Russian and Kazakh languages, this article thoroughly analyzes news posts in the Kazakhstan media space. The amassed datasets include texts labeled according to three sentiment classes: positive, negative, and neutral. The datasets are highly imbalanced, with a significant predominance of the positive class. Three resampling techniques (undersampling, oversampling, and synthetic minority oversampling (SMOTE)) are used to resample the datasets to deal with this issue. Subsequently, the texts are vectorized with the TF-IDF metric and classified with seven machine learning (ML) algorithms: naïve Bayes, support vector machine, logistic regression, k-nearest neighbors, decision tree, random forest, and XGBoost. Experimental results reveal that oversampling and SMOTE with logistic regression, decision tree, and random forest achieve the best classification scores. These models are effectively employed in the developed social analytics platform.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2021
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Multi-Class Sentiment Analysis of Social Media Data with Machine Learning Algorithms

Abstract

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

Comprehensive DDoS Attack Classification Using Machine Learning Algorithms
Olga Ussatova ... Yenlik Begimbayeva
Computers, Materials & Continua | VOL. 73
Olga Ussatova, et. al.Olga Ussatova ... Yenlik Begimbayeva
01 Jan 2021
Computers, Materials & Continua | VOL. 73

Machine learning for the prediction of problems in steel tube bending process
Volkan Görüş ... Mehmet Çevik
Engineering Applications of Artificial Intelligence | VOL. 133
Volkan Görüş, et. al.Volkan Görüş ... Mehmet Çevik
16 May 2024
Engineering Applications of Artificial Intelligence | VOL. 133

Confirming the statistically significant superiority of tree-based machine learning algorithms over their counterparts for tabular data.
Haohui Lu ... Shahadat Uddin
PLOS ONE | VOL. 19
Haohui Lu, et. al.Haohui Lu ... Shahadat Uddin
18 Apr 2024
PLOS ONE | VOL. 19

Identifying chronic disease patients using predictive algorithms in pharmacy administrative claims: an application in rheumatoid arthritis
Ervant J Maksabedian Hernandez ... Jessica Tiu
Journal of Medical Economics | VOL. 24
Ervant J Maksabedian Hernandez, et. al.Ervant J Maksabedian Hernandez ... Jessica Tiu
01 Jan 2020
Journal of Medical Economics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Class Sentiment Analysis of Social Media Data with Machine Learning Algorithms

Abstract

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua