Depression Risk Prediction for Chinese Microblogs via Deep-Learning Methods: Content Analysis

Xiaofeng Wang,Jie Zheng,Wanting Li,Buzhou Tang,Shuai Chen,Qingcai Chen,Yejie Zhou,Tao Li,Jun Yan

doi:10.2196/17958

Abstract

BackgroundDepression is a serious personal and public mental health problem. Self-reporting is the main method used to diagnose depression and to determine the severity of depression. However, it is not easy to discover patients with depression owing to feelings of shame in disclosing or discussing their mental health conditions with others. Moreover, self-reporting is time-consuming, and usually leads to missing a certain number of cases. Therefore, automatic discovery of patients with depression from other sources such as social media has been attracting increasing attention. Social media, as one of the most important daily communication systems, connects large quantities of people, including individuals with depression, and provides a channel to discover patients with depression. In this study, we investigated deep-learning methods for depression risk prediction using data from Chinese microblogs, which have potential to discover more patients with depression and to trace their mental health conditions.ObjectiveThe aim of this study was to explore the potential of state-of-the-art deep-learning methods on depression risk prediction from Chinese microblogs.MethodsDeep-learning methods with pretrained language representation models, including bidirectional encoder representations from transformers (BERT), robustly optimized BERT pretraining approach (RoBERTa), and generalized autoregressive pretraining for language understanding (XLNET), were investigated for depression risk prediction, and were compared with previous methods on a manually annotated benchmark dataset. Depression risk was assessed at four levels from 0 to 3, where 0, 1, 2, and 3 denote no inclination, and mild, moderate, and severe depression risk, respectively. The dataset was collected from the Chinese microblog Weibo. We also compared different deep-learning methods with pretrained language representation models in two settings: (1) publicly released pretrained language representation models, and (2) language representation models further pretrained on a large-scale unlabeled dataset collected from Weibo. Precision, recall, and F1 scores were used as performance evaluation measures.ResultsAmong the three deep-learning methods, BERT achieved the best performance with a microaveraged F1 score of 0.856. RoBERTa achieved the best performance with a macroaveraged F1 score of 0.424 on depression risk at levels 1, 2, and 3, which represents a new benchmark result on the dataset. The further pretrained language representation models demonstrated improvement over publicly released prediction models.ConclusionsWe applied deep-learning methods with pretrained language representation models to automatically predict depression risk using data from Chinese microblogs. The experimental results showed that the deep-learning methods performed better than previous methods, and have greater potential to discover patients with depression and to trace their mental health conditions.

Highlights

BackgroundMental health is an important component of personal well-being and public health as reported by the World Health Organization (WHO) [1]
Among the three deep-learning methods, bidirectional encoder representations from transformers (BERT) achieved the best performance with a microaveraged F1 score of 0.856
robustly optimized BERT pretraining approach (RoBERTa) achieved the best performance with a macroaveraged F1 score of 0.424 on depression risk at levels 1, 2, and 3, which represents a new benchmark result on the dataset

Summary

Introduction

BackgroundMental health is an important component of personal well-being and public health as reported by the World Health Organization (WHO) [1]. Most diagnoses of depressive illness are based on self-reports or self-diagnosis of patients [9,10]. A high proportion of patients with depression cannot be discovered as they do not want to disclose or discuss their mental health conditions with others. It is not easy to discover patients with depression owing to feelings of shame in disclosing or discussing their mental health conditions with others. As one of the most important daily communication systems, connects large quantities of people, including individuals with depression, and provides a channel to discover patients with depression. We investigated deep-learning methods for depression risk prediction using data from Chinese microblogs, which have potential to discover more patients with depression and to trace their mental health conditions

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JMIR Medical Informatics	Publication Date: Jul 29, 2020
Citations: 38	License type: cc-by

R Discovery Prime

R Discovery Prime

Depression Risk Prediction for Chinese Microblogs via Deep-Learning Methods: Content Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR Medical Informatics

Lead the way for us

Similar Papers

ALBERT with Knowledge Graph Encoder Utilizing Semantic Similarity for Commonsense Question Answering
Byeongmin Choi ... Yeunwoong Kyung
Intelligent Automation & Soft Computing | VOL. 36
Byeongmin Choi, et. al.Byeongmin Choi ... Yeunwoong Kyung
01 Jan 2023
Intelligent Automation & Soft Computing | VOL. 36

Assessing depression risk in Chinese microblogs: a corpus and machine learning methods
Xiaofeng Wang ... Wanting Li
-
Xiaofeng Wang, et. al.Xiaofeng Wang ... Wanting Li
01 Jun 2019
01 Jun 2019

Classification of Fire Related Tweets on Twitter Using Bidirectional Encoder Representations from Transformers (BERT)
Jairus Mingua ... Evan Joy Celino
-
Jairus Mingua, et. al.Jairus Mingua ... Evan Joy Celino
28 Nov 2021
28 Nov 2021

Korean clinical entity recognition from diagnosis text using BERT
Young-Min Kim ... Tae-Hoon Lee
BMC Medical Informatics and Decision Making | VOL. 20
Young-Min Kim, et. al.Young-Min Kim ... Tae-Hoon Lee
01 Sep 2020
BMC Medical Informatics and Decision Making | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Depression Risk Prediction for Chinese Microblogs via Deep-Learning Methods: Content Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR Medical Informatics