Abstract
Online social networking (SN) data presents a data stream that is rich in context and temporal information. It holds promise for predicting suicidal thoughts and behaviors. The fusion of SN data with machine learning algorithms offers a potential path forward. This research proposes a Max Voting Ensemble classifier model applied to a Reddit dataset for the identification of suicidal ideation. The preprocessing involves data cleansing, tokenization, and lemmatization. Additionally, TF-IDF and Word2Vec word embedding techniques are applied. Diverse machine learning algorithms, including Support Vector Machines (SVM), Logistic Regression (LR), Random Forest (RF), Multinomial Naive Bayes (MNB), AdaBoost, and XGBoost, are implemented. The results of selected Machine Learning Classifiers (MLCs) are amalgamated using a Max Voting Ensemble classifier. The research findings clearly indicate that the Max Voting Ensemble classifier yields improved precision of 91.39% coupled with a substantial accuracy of 87.5%. The application of Ensembling Techniques (ET) to SN data holds the potential to address the complexities and modeling challenges inherent in predicting acute Suicidal Ideation within these dynamic time scales.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.