Autoencoder-Based Feature Extraction for Identifying Hate Speech Spreaders in Social Media

Gunjan Kumar,Amit Kumar Singh,Jyoti Prakash Singh

doi:10.1109/tcss.2023.3240098

Abstract

Hate speech on social media has become a big problem, making regular users very upset and giving victims depression and suicidal thoughts. Early identification of the user spreading this type of hate speech may be a better solution, allowing hate speech to be stopped at source. In this article, we attempt to identify these hate speech spreaders by finding a representation for each user. Each user’s comments are aggregated and fed to an auto-encoder to train it. The encoder part of the auto-encoder is used to get an encoded vector for each user. The encoded vector is used with different machine learning (ML) classifiers to determine if a user is spreading hate speech. The proposed model was tested using the dataset released by PAN 2021 (https://pan.webis.de/data.html) hate speech spreader profiling competition in English and Spanish. The experimental results show that support vector machine (SVM) with encoded vectors as features outperforms existing models with an accuracy of 92% for both English and Spanish dataset. The proposed features extraction technique is found to be equally effective at identifying fake news spreaders on fake news datasets provided by PAN 2020 yielding accuracy values of 95% and 83% for English and Spanish, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Autoencoder-Based Feature Extraction for Identifying Hate Speech Spreaders in Social Media

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems

Lead the way for us

Journal: IEEE Transactions on Computational Social Systems	Publication Date: Jan 1, 2024
Citations: 3

Similar Papers

Hate Speech Classification Using SVM and Naive BAYES
...
arXiv (Cornell University) | VOL. -
, et. al. ...
21 Mar 2022
arXiv (Cornell University) | VOL. -

Hate Speech Detection in Indonesian Twitter Texts using Bidirectional Gated Recurrent Unit
Angela Marpaung ... Rita Rismala
-
Angela Marpaung, et. al.Angela Marpaung ... Rita Rismala
21 Jan 2021
21 Jan 2021

An Approach of Hate Speech Identification on Twitter Corpus
Kavita Kumari ... Anupam Jamatia
-
Kavita Kumari, et. al.Kavita Kumari ... Anupam Jamatia
01 Jan 2023
01 Jan 2023

Development of Pidgin English Hate Speech Classification System for Social Media
Folake Adegoke ... Eneh Agozie
American Journal of Information Science and Technology | VOL. 8
Folake Adegoke, et. al.Folake Adegoke ... Eneh Agozie
14 Jun 2024
American Journal of Information Science and Technology | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Autoencoder-Based Feature Extraction for Identifying Hate Speech Spreaders in Social Media

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems