Offensive Language Detection Using Soft Voting Ensemble Model

Brillian Fieri,Derwin Suhartono

doi:10.13164/mendel.2023.1.001

Abstract

Offensive language is one of the problems that have become increasingly severe along with the rise of the internet and social media usage. This language can be used to attack a person or specific groups. Automatic moderation, such as the usage of machine learning, can help detect and filter this particular language for someone who needs it. This study focuses on improving the performance of the soft voting classifier to detect offensive language by experimenting with the combinations of the soft voting estimators. The model was applied to a Twitter dataset that was augmented using several augmentation techniques. The features were extracted using Term Frequency-Inverse Document Frequency, sentiment analysis, and GloVe embedding. In this study, there were two types of soft voting models: machine learning-based, with the estimators of Random Forest, Decision Tree, Logistic Regression, Naïve Bayes, and AdaBoost as the best combination, and deep learning-based, with the best estimator combination of Convolutional Neural Network, Bidirectional Long Short-Term Memory, and Bidirectional Gated Recurrent Unit. The results of this study show that the soft voting classifier was better in performance compared to classic machine learning and deep learning models on both original and augmented datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Offensive Language Detection Using Soft Voting Ensemble Model

Abstract

Talk to us

Similar Papers

More From: MENDEL

Lead the way for us

Journal: MENDEL	Publication Date: Jun 30, 2023
License type: CC BY-NC-SA 4.0

Similar Papers

Pengaruh Metode Penyeimbangan Kelas Terhadap Tingkat Akurasi Analisis Sentimen pada Tweets Berbahasa Indonesia
Hapnes Toba ... Ivan Nathaniel Husada
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6
Hapnes Toba, et. al.Hapnes Toba ... Ivan Nathaniel Husada
11 Aug 2020
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6

A Deep Learning Approach Combining CNN and Bi-LSTM with SVM Classifier for Arabic Sentiment Analysis
Omar Alharbi
International Journal of Advanced Computer Science and Applications | VOL. 12
Omar AlharbiOmar Alharbi
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Empirical comparison of deep learning models for fNIRS pain decoding.
Raul Fernandez Rojas ... Keng-Liang Ou
Frontiers in neuroinformatics | VOL. 18
Raul Fernandez Rojas, et. al.Raul Fernandez Rojas ... Keng-Liang Ou
14 Feb 2024
Frontiers in neuroinformatics | VOL. 18

Sentiment Analysis of Arabic Tweets using Deep Learning
Maha Heikal ... Nagwa El-Makky
Procedia Computer Science | VOL. 142
Maha Heikal, et. al.Maha Heikal ... Nagwa El-Makky
01 Jan 2018
Procedia Computer Science | VOL. 142

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Offensive Language Detection Using Soft Voting Ensemble Model

Abstract

Talk to us

Similar Papers

More From: MENDEL