Reading Between the Lines: Machine Learning Ensemble and Deep Learning for Implied Threat Detection in Textual Data

Muhammad Owais Raza,Areej Fatemah Meghji,Naeem Ahmed Mahoto,Mana Saleh Al Reshan,Hamad Ali Abosaq,Adel Sulaiman,Asadullah Shaikh

doi:10.1007/s44196-024-00580-y

Abstract

With the increase in the generation and spread of textual content on social media, natural language processing (NLP) has become an important area of research for detecting underlying threats, racial abuse, violence, and implied warnings in the content. The subtlety and ambiguity of language make the development of effective models for detecting threats in text a challenging task. This task is further complicated when the threat is not explicitly conveyed. This study focuses on the task of implied threat detection using an explicitly designed machine-generated dataset with both linguistic and lexical features. We evaluated the performance of different machine learning algorithms on these features including Support Vector Machines, Logistic Regression, Naive Bayes, Decision Tree, and K-nearest neighbors. The ensembling approaches of Adaboost, Random Forest, and Gradient Boosting were also explored. Deep learning modeling was performed using Long Short-Term Memory, Deep Neural Networks (DNN), and Bidirectional Long Short-Term Memory (BiLSTM). Based on the evaluation, it was observed that classical and ensemble models overfit while working with linguistic features. The performance of these models improved when working with lexical features. The model based on logistic regression exhibited superior performance with an F1 score of 77.13%. While experimenting with deep learning models, DNN achieved an F1 score of 91.49% while the BiLSTM achieved an F1 score of 91.61% while working with lexical features. The current study provides a baseline for future research in the domain of implied threat detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reading Between the Lines: Machine Learning Ensemble and Deep Learning for Implied Threat Detection in Textual Data

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence Systems

Lead the way for us

Journal: International Journal of Computational Intelligence Systems	Publication Date: Jul 15, 2024
License type: CC BY 4.0

Similar Papers

Machine Learning Analysis of RNA-seq Data for Diagnostic and Prognostic Prediction of Colon Cancer
Erkan Bostanci ... Koray Acici
Sensors | VOL. 23
Erkan Bostanci, et. al.Erkan Bostanci ... Koray Acici
13 Mar 2023
Sensors | VOL. 23

Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset.
Khandaker Mohammad Mohi Uddin ... Md Ashraf Uddin
PloS one | VOL. 19
Khandaker Mohammad Mohi Uddin, et. al.Khandaker Mohammad Mohi Uddin ... Md Ashraf Uddin
01 Jan 2024
PloS one | VOL. 19

Predicting rainfall using machine learning, deep learning, and time series models across an altitudinal gradient in the North-Western Himalayas
Owais Ali Wani ... Mohamed A Mattar
Scientific Reports | VOL. 14
Owais Ali Wani, et. al.Owais Ali Wani ... Mohamed A Mattar
13 Nov 2024
Scientific Reports | VOL. 14

Comparative Performance of Machine Learning Algorithms in the Prediction of Indoor Daylight Illuminances
Jack Ngarambe ... Geun Young Yun
Sustainability | VOL. 12
Jack Ngarambe, et. al.Jack Ngarambe ... Geun Young Yun
01 Jun 2020
Sustainability | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reading Between the Lines: Machine Learning Ensemble and Deep Learning for Implied Threat Detection in Textual Data

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence Systems