Roman Urdu Hate Speech Detection Using Transformer-Based Model for Cyber Security Applications.

Muhammad Bilal,Salman Jan,Shaukat Ali,Shahrulniza Musa,Atif Khan

doi:10.3390/s23083909

Muhammad Bilal, Salman Jan + Show 3 more

Open Access

https://doi.org/10.3390/s23083909

Copy DOI

Abstract

Social media applications, such as Twitter and Facebook, allow users to communicate and share their thoughts, status updates, opinions, photographs, and videos around the globe. Unfortunately, some people utilize these platforms to disseminate hate speech and abusive language. The growth of hate speech may result in hate crimes, cyber violence, and substantial harm to cyberspace, physical security, and social safety. As a result, hate speech detection is a critical issue for both cyberspace and physical society, necessitating the development of a robust application capable of detecting and combating it in real-time. Hate speech detection is a context-dependent problem that requires context-aware mechanisms for resolution. In this study, we employed a transformer-based model for Roman Urdu hate speech classification due to its ability to capture the text context. In addition, we developed the first Roman Urdu pre-trained BERT model, which we named BERT-RU. For this purpose, we exploited the capabilities of BERT by training it from scratch on the largest Roman Urdu dataset consisting of 173,714 text messages. Traditional and deep learning models were used as baseline models, including LSTM, BiLSTM, BiLSTM + Attention Layer, and CNN. We also investigated the concept of transfer learning by using pre-trained BERT embeddings in conjunction with deep learning models. The performance of each model was evaluated in terms of accuracy, precision, recall, and F-measure. The generalization of each model was evaluated on a cross-domain dataset. The experimental results revealed that the transformer-based model, when directly applied to the classification task of the Roman Urdu hate speech, outperformed traditional machine learning, deep learning models, and pre-trained transformer-based models in terms of accuracy, precision, recall, and F-measure, with scores of 96.70%, 97.25%, 96.74%, and 97.89%, respectively. In addition, the transformer-based model exhibited superior generalization on a cross-domain dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Apr 12, 2023
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Roman Urdu Hate Speech Detection Using Transformer-Based Model for Cyber Security Applications.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Context-Aware Deep Learning Model for Detection of Roman Urdu Hate Speech on Social Media Platform
Muhammad Bilal ... Atif Khan
IEEE Access | VOL. 10
Muhammad Bilal, et. al.Muhammad Bilal ... Atif Khan
01 Jan 2021
IEEE Access | VOL. 10

Hierarchical Sentiment Analysis Framework for Hate Speech Detection: Implementing Binary and Multiclass Classification Strategy
Faria Naznin ... Shahran Rahman Alve
Cognizance Journal of Multidisciplinary Studies | VOL. 4
Faria Naznin, et. al.Faria Naznin ... Shahran Rahman Alve
30 Aug 2024
Cognizance Journal of Multidisciplinary Studies | VOL. 4

Hate Speech Detection in Roman Urdu
Muhammad Moin Khan ... Khurram Shahzad
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20
Muhammad Moin Khan, et. al.Muhammad Moin Khan ... Khurram Shahzad
31 Jan 2021
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20

A survey on hate speech detection and sentiment analysis using machine learning and deep learning models
Malliga Subramanian ... G Manikandan
Alexandria Engineering Journal | VOL. 80
Malliga Subramanian, et. al.Malliga Subramanian ... G Manikandan
24 Aug 2023
Alexandria Engineering Journal | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Roman Urdu Hate Speech Detection Using Transformer-Based Model for Cyber Security Applications.

Abstract

Talk to us

Similar Papers

More From: Sensors