A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.

Kamil Zeberga,Yalew Zelalem Jembre,Muhammad Attique,Farman Ali,Babar Shah,Tae-Sun Chung

doi:10.1155/2022/7893775

Abstract

With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety. These types of systems, which are mainly dependent on machine learning techniques, must be able to deal with obtaining the semantic and syntactic meaning of texts posted by users on social media. The data generated by users on social media contains unstructured and unpredictable content. Several systems based on machine learning and social media platforms have recently been introduced to identify health-related problems. However, the text representation and deep learning techniques employed provide only limited information and knowledge about the different texts posted by users. This is owing to a lack of long-term dependencies between each word in the entire text and a lack of proper exploitation of recent deep learning schemes. In this paper, we propose a novel framework to efficiently and effectively identify depression and anxiety-related posts while maintaining the contextual and semantic meaning of the words used in the whole corpus when applying bidirectional encoder representations from transformers (BERT). In addition, we propose a knowledge distillation technique, which is a recent technique for transferring knowledge from a large pretrained model (BERT) to a smaller model to boost performance and accuracy. We also devised our own data collection framework from Reddit and Twitter, which are the most common social media sites. Finally, we employed word2vec and BERT with Bi-LSTM to effectively analyze and detect depression and anxiety signs from social media posts. Our system surpasses other state-of-the-art methods and achieves an accuracy of 98% using the knowledge distillation technique.

Highlights

With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety
bidirectional long short-term memory (Bi-long short-term memory (LSTM)) and distilled bidirectional encoder representations from transformers (BERT) achieved higher classification accuracies of 96% and 98%, respectively. e main reason is that the student model mimics the teacher model that initially was trained on general text corpus such as Wikipedia and BookCorpus. erefore, distilled BERT obtains a competitive or even a superior performance when fine-tuned to our depression- and anxiety-related data domain. e learning of this small model from the bigger pretrained model in our proposed framework is termed knowledge distillation
We developed a strongly constructed framework for the detection of mental health problems using deep learning techniques such as BERT, Bi-LSTM, and a knowledge distillation based on social media content created by users. e proposed framework enhances the accuracy of smart healthcare systems to detect mental-health-related problems mainly depression and anxiety. is research work can be utilized to build a real-time system for early mentalhealth-related problem detection mainly based on user posts on Reddit and Twitter

Summary

Introduction

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Intelligence and Neuroscience	Publication Date: Mar 3, 2022
Citations: 65	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience

Lead the way for us

Similar Papers

Engineering Document Summarization Using Sentence Representations Generated by Bidirectional Language Model
Yunjian Qiu ... Yan Jin
-
Yunjian Qiu, et. al.Yunjian Qiu ... Yan Jin
17 Aug 2021
17 Aug 2021

Toxic Comment Identification and Classification using BERT and SVM
Ivander Gladwin ... Bryan Valerian
-
Ivander Gladwin, et. al.Ivander Gladwin ... Bryan Valerian
07 Sep 2022
07 Sep 2022

Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT
Shoya Wada ... Yasushi Matsumura
Artificial Intelligence In Medicine | VOL. 153
Shoya Wada, et. al.Shoya Wada ... Yasushi Matsumura
05 May 2024
Artificial Intelligence In Medicine | VOL. 153

Classification Performance Comparison of BERT and IndoBERT on SelfReport of COVID-19 Status on Social Media
Irwan Budiman ... Muhammad Itqan Mazdadi
Journal of Computer Sciences Institute | VOL. 30
Irwan Budiman, et. al.Irwan Budiman ... Muhammad Itqan Mazdadi
20 Mar 2024
Journal of Computer Sciences Institute | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience