XRBi-GAC: A hybrid deep learning framework for multilingual toxicity detection

Nitin Kumar Singh,Satish Chand,Pardeep Singh,Prativa Das

doi:10.3233/jifs-224536

Abstract

Social media platforms allow people across the globe to share their thoughts and opinions and conveniently communicate with each other. Apart from various advantages of social media, it is also misused by a set of users for hate-mongering with toxic and offensive comments. The majority of the earlier proposed toxicity detection methods are primarily focused on the English language, but there is a lack of research on low-resource languages and multilingual text data. We propose an XRBi-GAC framework comprising XLM-RoBERTa, Bi-GRU with self-attention and capsule networks for multilingual toxic text detection. A loss function is also presented, which fuses the binary cross-entropy loss and focal loss to address the class imbalance problem. We evaluated the proposed framework on two datasets, namely, the Jigsaw Multilingual Toxic Comment dataset and HASOC 2019 dataset and achieved F1-score of 0.865 and 0.829, respectively. The results of the experiments show that the proposed framework has outperformed the state-of-the-art multilingual models XLM-RoBERTa and mBERT on both datasets, which shows the versatility and robustness of the proposed XRBi-GAC framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

XRBi-GAC: A hybrid deep learning framework for multilingual toxicity detection

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Similar Papers

Navigating Social Media in #Ophthalmology
Edmund Tsui ... Rajesh C Rao
Ophthalmology | VOL. 126
Edmund Tsui, et. al.Edmund Tsui ... Rajesh C Rao
20 May 2019
Ophthalmology | VOL. 126

Roman Urdu toxic comment classification
Hafiz Hassaan Saeed ... Asim Karim
Language Resources and Evaluation | VOL. 55
Hafiz Hassaan Saeed, et. al.Hafiz Hassaan Saeed ... Asim Karim
29 Jan 2021
Language Resources and Evaluation | VOL. 55

Going Viral: The 3 Rs of Social Media Messaging during Public Health Emergencies.
Bhavini Patel Murthy ... Tanya Telfair Leblanc
Health security | VOL. 19
Bhavini Patel Murthy, et. al.Bhavini Patel Murthy ... Tanya Telfair Leblanc
01 Feb 2021
Health security | VOL. 19

Multilingual Short Text Classification via Convolutional Neural Network
Jiao Liu ... Yahui Zhao
-
Jiao Liu, et. al.Jiao Liu ... Yahui Zhao
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

XRBi-GAC: A hybrid deep learning framework for multilingual toxicity detection

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems