Hate Speech Detection in Twitter using Transformer Methods

Raymond T Mutanga,Nalindren Naicker,Oludayo O

doi:10.14569/ijacsa.2020.0110972

Raymond T Mutanga, Nalindren Naicker + Show 1 more

Open Access

https://doi.org/10.14569/ijacsa.2020.0110972

Copy DOI

Abstract

Social media networks such as Twitter are increasingly utilized to propagate hate speech while facilitating mass communication. Recent studies have highlighted a strong correlation between hate speech propagation and hate crimes such as xenophobic attacks. Due to the size of social media and the consequences of hate speech in society, it is essential to develop automated methods for hate speech detection in different social media platforms. Several studies have investigated the application of different machine learning algorithms for hate speech detection. However, the performance of these algorithms is generally hampered by inefficient sequence transduction. The Vanilla recurrent neural networks and recurrent neural networks with attention have been established as state-of-the-art methods for the assignments of sequence modeling and sequence transduction. Unfortunately, these methods suffer from intrinsic problems such as long-term dependency and lack of parallelization. In this study, we investigate a transformer-based method and tested it on a publicly available multiclass hate speech corpus containing 24783 labeled tweets. DistilBERT transformer method was compared against attention-based recurrent neural networks and other transformer baselines for hate speech detection in Twitter documents. The study results show that DistilBERT transformer outperformed the baseline algorithms while allowing parallelization.

Highlights

Social media platforms such as Twitter are publicly accessible digital resources for online communication and collaboration
We propose DistilBERT a streamlined version of Bidirectional encoder representations from text (BERT) that uses only half the number of parameters of BERT [27] but retains the performance of BERT in many text processing tasks [33] while making the inference 60% faster than BERT [34]
Results of the proposed DistilBERT method was compared against results computed by BERT, XLNet, RoBERTa and attention-based long short-term memory (LSTM)

Summary

Introduction

Social media platforms such as Twitter are publicly accessible digital resources for online communication and collaboration. Social media companies such as Twitter and Facebook employ human annotators to manually delete messages deemed to be hateful [3]. Users of these platforms are encouraged to flag and report contents they perceive to be inimical to the public. Machine learning algorithms can be classified into two broad categories, which are classical machine learning and deep learning. Both methods have been exploited and tested for hate speech detection in earlier studies

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2020
Citations: 27	License type: cc-by

R Discovery Prime

R Discovery Prime

Hate Speech Detection in Twitter using Transformer Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Multi-Temporal Unmanned Aerial Vehicle Remote Sensing for Vegetable Mapping Using an Attention-Based Recurrent Convolutional Neural Network
Quanlong Feng ... Bowen Niu
Remote Sensing | VOL. 12
Quanlong Feng, et. al.Quanlong Feng ... Bowen Niu
22 May 2020
Remote Sensing | VOL. 12

Putting the Toothpaste Back in the Tube: Against Online Hate Speech.
Brenda K Wiederhold
Cyberpsychology, behavior and social networking | VOL. 26
Brenda K WiederholdBrenda K Wiederhold
13 Jun 2023
Cyberpsychology, behavior and social networking | VOL. 26

A Comparative Study of Deep Learning Methods for Hate Speech and Offensive Language Detection in Textual Data
Yogesh Yadav ... Rohan Kumar Gupta
-
Yogesh Yadav, et. al.Yogesh Yadav ... Rohan Kumar Gupta
19 Dec 2021
19 Dec 2021

Bangla hate speech detection on social media using attention-based recurrent neural network
Amit Kumar Das ... Abdullah Al Asif
Journal of Intelligent Systems | VOL. 30
Amit Kumar Das, et. al.Amit Kumar Das ... Abdullah Al Asif
09 Apr 2021
Journal of Intelligent Systems | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hate Speech Detection in Twitter using Transformer Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications