Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset.

Khandaker Mohammad Mohi Uddin,Hasibul Hamim,Mst Nishat Tasnim Mim,Arnisha Akhter,Md Ashraf Uddin

doi:10.1371/journal.pone.0308862

Abstract

Through the advancement of the contemporary web and the rapid adoption of social media platforms such as YouTube, Twitter, and Facebook, for example, life has become much easier when dealing with certain highly personal problems. The far-reaching consequences of online harassment require immediate preventative steps to safeguard psychological wellness and scholarly achievement via detection at an earlier stage. This piece of writing aims to eliminate online harassment and create a criticism-free online environment. In the paper, we have used a variety of attributes to evaluate a large number of Bengali comments. We communicate cleansed data utilizing machine learning (ML) methods and natural language processing techniques, which must be followed using term frequency and reverse document frequency (TF-IDF) with a count vectorizer. In addition, we used tokenization with padding to feed our deep learning (DL) models. Using mathematical visualization and natural language processing, online bullying could be detected quickly. Multi-layer Perceptron (MLP), K-Nearest Neighbors (K-NN), Extreme Gradient Boosting (XGBoost), Adaptive Boosting Classifier (AdaBoost), Logistic Regression Classifier (LR), Random Forest Classifier (RF), Bagging Classifier, Stochastic Gradient Descent (SGD), Voting Classifier, and Stacking are employed in the research we conducted. We expanded our investigation to include different DL frameworks. Deep Neural Networks (DNN), Convolutional Neural Networks (CNN), Convolutional-Long Short-Term Memory (C-LSTM), and Bidirectional Long Short-Term Memory (BiLSTM) are all implemented. A large amount of data is required to precisely recognize harassing behavior. To rapidly recognize internet harassment written material, we combined two sets of data, producing 94,000 Bengali comments from different points of view. After understanding the ML and DL models, we can see that a hybrid model (MLP+SGD+LR) performed more effectively when compared to other models, its evaluation accuracy is 99.34%, precision is 99.34%, recall rate is 99.33%, and F1 score is 99.34% on multi-label class. For the binary classification model, we got 99.41% of accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Journal: PloS one	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing
Sergio Rubio-Martín ... José Alberto Benítez-Andrades
Health Information Science and Systems | VOL. 12
Sergio Rubio-Martín, et. al.Sergio Rubio-Martín ... José Alberto Benítez-Andrades
06 Mar 2024
Health Information Science and Systems | VOL. 12

How good are different machine and deep learning models in forecasting the future price of metals? Full sample versus sub-sample
Anu Varshini ... Parthajit Kayal
Resources Policy | VOL. 92
Anu Varshini, et. al.Anu Varshini ... Parthajit Kayal
30 Apr 2024
Resources Policy | VOL. 92

Calibration of Typhoon Track Forecasts Based on Deep Learning Methods
Chengchen Tao ... Zhizu Wang
Atmosphere | VOL. 15
Chengchen Tao, et. al.Chengchen Tao ... Zhizu Wang
17 Sep 2024
Atmosphere | VOL. 15

The Design of an Intelligent Lightweight Stock Trading System Using Deep Learning Models: Employing Technical Analysis Methods
Seongjae Yu ... Sung-Byung Yang
Systems | VOL. 11
Seongjae Yu, et. al.Seongjae Yu ... Sung-Byung Yang
13 Sep 2023
Systems | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset.

Abstract

Talk to us

Similar Papers

More From: PloS one