Detection of Hate Speech Text in Afan Oromo Social Media using Machine Learning Approach

Naol Bakala Defersha,Kula Kekeba Tune

doi:10.17485/ijst/v14i31.1019

Abstract

Objectives: This study aims to develop a hate speech detection model for Afan Oromo’s texts on social networks like Facebook and Twitter using a machine learning algorithm. Methods: we collected comments and posts from social media like Facebook and Twitter pages of BBC Afan Oromo, OBN Afan Oromo, Fana Afan Oromo Program, Politicians, Activists, Religious Men, and Oromia Communication Bureau using Face pager tool. The collected data was labelled using Afan Oromo hate speech evaluation system we developed. Text preprocessing tasks applied on data to remove special characters, stop-words,HTML Tags, extra whitespaces, numbers, lemmatization. The n-gram and TFIDF was applied for feature extraction task to obtain benchmark Afan Oromo hate speech detection dataset. Researchers split dataset into train and test set. Finally, we applied Support Vector Classifier, Multinomial NB, Linear Support Vector Classifier, Logistic Regression decision tree and Random Forest Classifier on 67% of trained data. The performance of proposed model also evaluated using F-score. We also test the performance of developed model by loading test set into it. Findings: Hate speech on social media violates the welfare of Ethnic groups and citizens for living together. Many researches have been doing for English, Amharic, and other Languages to detect hate content from social media. This study has focused on developing a prototype for Afan Oromo hate speech detection model using machine learning algorithms and evaluate its performance in which we found Linear Support Vector Classifier scored highest f1-score value is 64%. Novelty: Afan Oromo hate speech detection framework proposed and successfully implemented to develop Afan Oromo hate speech detection model. We wrote python script that overcome problems typos in Afan Oromo in addition to designing python scripts that recognized apostrophe “ ’ ” as important letter for Afan Oromo word formation. Yet, no researchers have used combination of n-gram and TF-IDF for feature extraction. In this study, the n-gram and TF-IDF used for feature extraction approach to build model that detect Afan Oromo hate speech on Social media. Keywords: Afan Oromo; Decision tree; Facebook; Hate Speech; Linear Support Vector Classifier; Machine Learning; MultinomialNB; Social Media; Support Vector Classifier; Decision Tree and Random Forest Classifier

Highlights

Social media allows users to create, remove and share their ideas freely using the Internet connection
Afan Oromo hate speech detection data collected from Facebook and Twitter social media platforms using Face pager
We have outlined that developing hate speech detection for Afan Oromo social media is essential to eradicate the risk of hate speech on social welfare

Summary

Introduction

Social media allows users to create, remove and share their ideas freely using the Internet connection. The maximum number of characters per tweet has recently been increased from 200 to 280, encouraging greater flexibility in interaction. Social media allows users to freely communicate and express their ideas using natural language. The challenges of using natural language over social media is generation of hate speech that violate rights of individuals by disseminating hate speech on various perspectives when users freely express opinion (2,3). Hate speech is an expression that violates the right of people of different perspectives, insulting people those are part of religion, activists, by posting opinions, expressions, emotion and feelings over social media platforms

Objectives

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Indian Journal of Science and Technology	Publication Date: Aug 17, 2021
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Detection of Hate Speech Text in Afan Oromo Social Media using Machine Learning Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology

Lead the way for us

Similar Papers

Tuning Hyperparameters of Machine Learning Methods for Afan Oromo Hate Speech Text Detection for Social Media
Naol Bakala Defersha ... Karthikeyan Kaliyaperumal
-
Naol Bakala Defersha, et. al.Naol Bakala Defersha ... Karthikeyan Kaliyaperumal
16 Dec 2021
16 Dec 2021

An Approach of Hate Speech Identification on Twitter Corpus
Kavita Kumari ... Anupam Jamatia
-
Kavita Kumari, et. al.Kavita Kumari ... Anupam Jamatia
01 Jan 2023
01 Jan 2023

Detecting Text-Bullying on Twitter Using Machine Learning Algorithms
Abdullah Yahya Abdullah Amer ... Tamanna Siddiqui
-
Abdullah Yahya Abdullah Amer, et. al.Abdullah Yahya Abdullah Amer ... Tamanna Siddiqui
01 Jan 2020
01 Jan 2020

Hybrid ANOVA and LASSO Methods for Feature Selection and Linear Support Vector, Multilayer Perceptron and Random Forest Classifiers Based on Spark Environment for Microarray Data Classification
Wafaa S Albaldawi ... Rafah M Almuttairi
IOP Conference Series: Materials Science and Engineering | VOL. 1094
Wafaa S Albaldawi, et. al.Wafaa S Albaldawi ... Rafah M Almuttairi
01 Feb 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1094

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of Hate Speech Text in Afan Oromo Social Media using Machine Learning Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology