Multilingual Sentiment Mining System to Prognosticate Governance

Reza Khademi Zahedi,Hooman Khademi Zahedi,Timon Rabczuk,Naif Alajlan

doi:10.32604/cmc.2022.021384

Abstract

In the age of the internet, social media are connecting us all at the tip of our fingers. People are linkedthrough different social media. The social network, Twitter, allows people to tweet their thoughts on any particular event or a specific political body which provides us with a diverse range of political insights. This paper serves the purpose of text processing of a multilingual dataset including Urdu, English, and Roman Urdu. Explore machine learning solutions for sentiment analysis and train models, collect the data on government from Twitter, apply sentiment analysis, and provide a python library that classifies text sentiment. Training data contained tweets in three languages: English: 200k, Urdu: 200k and Roman Urdu: 11k. Five different classification models are applied to determine sentiments, and eventually, the use of ensemble technique to move forward with the acquired results is explored. The Logistic Regression model performed best with an accuracy of 75%, followed by the Linear Support Vector classifier and Stochastic Gradient Descent model, both having 74% accuracy. Lastly, Multinomial Naïve Bayes and Complement Naïve Bayes models both achieved 73% accuracy.

Highlights

Social media is playing a vital role in the estimation of the perception of public opinions.Social media content is widely available and easy to access to predict people’s perspectives on any current issue [1]
We provide a Python library for multilingual sentiment analysis focused on the three languages discussed earlier
The Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer was used for word embedding and TF-IDF Transformer for term weighting

Summary

Introduction

Social media is playing a vital role in the estimation of the perception of public opinions. Political bodies tend to adopt a specific tone for their target results If they want to start a campaign and want public support, a positive temper is adapted and attached pictures with tweets, whereas their tone is neutral for information sharing. Dictionary-based techniques are compelling at times, which require not many exertions in human-marked reports This survey mainly focused on sentiment analysis of the Twitter dataset, which helps analyse the information and filter them out either as positive, negative, or neutral. The objective of the research paper is to investigate preprocessing techniques and implement various machine learning paradigms Well recognised for their performance on sentiment analysis of textual documents and provide a python package that predicts sentiment and opinion mining of data about governance. The results are presented and explained along with the conclusion

Literature Review

Data and Methodology

Dataset

Governance Concerning Dataset

Data Cleaning and Preprocessing

Feature Extraction and Classification Models

Feature Extraction

Classification Models

Results and Discussions

Comparative Analysis of Models

Ensemble Technique

Multilingual Sentiment Classifier

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Multilingual Sentiment Mining System to Prognosticate Governance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

Discourse Based Opinion Mining on Roman Urdu Data
Zareen Sharaf ... Husnain Manzoor Ali
Journal of Independent Studies and Research Computing | VOL. 17
Zareen Sharaf, et. al.Zareen Sharaf ... Husnain Manzoor Ali
01 Jan 2019
Journal of Independent Studies and Research Computing | VOL. 17

An Unsupervised Approach for Sentiment Analysis on Social Media Short Text Classification in Roman Urdu
Kiran Shahzadi ... Toqir A Rana
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 21
Kiran Shahzadi, et. al.Kiran Shahzadi ... Toqir A Rana
03 Nov 2021
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 21

Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media
Hsien-Tsung Chang ... Lal Khan
Applied Sciences | VOL. 12
Hsien-Tsung Chang, et. al.Hsien-Tsung Chang ... Lal Khan
04 Mar 2022
Applied Sciences | VOL. 12

Sentiment Analysis of User Reviews about Hotel in Roman Urdu
Muhammad Abdul Qayum ... Mudassar Ahmad
-
Muhammad Abdul Qayum, et. al.Muhammad Abdul Qayum ... Mudassar Ahmad
16 Dec 2020
16 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual Sentiment Mining System to Prognosticate Governance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua