Multilingual Code-Mixed Sentiment Analysis in Hate Speech

Tulika Ranjan,Sujata Swain,Ajaya Kumar Parida,Anish Singh,Rina Kumari,Anjan Bandyopadhyay

doi:10.12694/scpe.v24i4.2375

Abstract

Sentiment analysis discovers the emotion expressed in a text. It helps in analyzing the product reviews, customer feedback and survey responses. Researchers have developed various algorithms for this purpose, however, they have majorly focused only on the sentiment analysis in English language. Although, few works are available for Hindi and multilingual sentiment analysis, however, these works are not efficient enough to perform sentiment analysis in code-mixed languages. To overcome the limitation of the existing works, this paper presents a multilingual code-mixed language model which identifies the sentiments of the hate speech dataset extracted from Twitter. As the hate speech dataset with sentiment labels are not available, we first collect the data from Twitter. After that we label the data using a transformer-based pretrained sentiment analysis model trained on a large corpus of tweets in multiple languages. We pass our collected data as test data to this model and predict the sentiment labels. Now, we train six different machine learning models to perform our own task i.e sentiment analysis for multilingual code-mixed hate speech dataset. The machine learning models perform well across multiple languages and also code-mixed languages. In future, it can be easily adapted to different classification tasks based on code-mixed languages. The results yield that hate speech invokes negative sentiment whereas non-hate speech reflects either positive or neutral sentiment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multilingual Code-Mixed Sentiment Analysis in Hate Speech

Abstract

Talk to us

Similar Papers

More From: Scalable Computing: Practice and Experience

Lead the way for us

Similar Papers

T-HSAB: A Tunisian Hate Speech and Abusive Dataset
Hatem Haddad ... Asma Oueslati
-
Hatem Haddad, et. al.Hatem Haddad ... Asma Oueslati
01 Jan 2019
01 Jan 2019

A Literature Review of Textual Hate Speech Detection Methods and Datasets
Fatimah Alkomah ... Xiaogang Ma
Information | VOL. 13
Fatimah Alkomah, et. al.Fatimah Alkomah ... Xiaogang Ma
26 May 2022
Information | VOL. 13

Sentiment Analysis in Multiple Languages: A Review of Current Approaches and Challenges
C Kumaresan ... P Thangaraju
REST Journal on Data Analytics and Artificial Intelligence | VOL. 2
C Kumaresan, et. al.C Kumaresan ... P Thangaraju
01 Mar 2023
REST Journal on Data Analytics and Artificial Intelligence | VOL. 2

Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Aryan Patil ... Varad Patwardhan
-
Aryan Patil, et. al.Aryan Patil ... Varad Patwardhan
07 Apr 2023
07 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual Code-Mixed Sentiment Analysis in Hate Speech

Abstract

Talk to us

Similar Papers

More From: Scalable Computing: Practice and Experience