A research on depression recognition based on voice pre-training model

Li Zhang,Wenjing Zhang,Xiangsheng Huang,Yilong Liao

doi:10.7507/1001-5515.202304008

Abstract

For the increasing number of patients with depression, this paper proposes an artificial intelligence method to effectively identify depression through voice signals, with the aim of improving the efficiency of diagnosis and treatment. Firstly, a pre-training model called wav2vec 2.0 is fine-tuned to encode and contextualize the speech, thereby obtaining high-quality voice features. This model is applied to the publicly available dataset - the distress analysis interview corpus-wizard of OZ (DAIC-WOZ). The results demonstrate a precision rate of 93.96%, a recall rate of 94.87%, and an F1 score of 94.41% for the binary classification task of depression recognition, resulting in an overall classification accuracy of 96.48%. For the four-class classification task evaluating the severity of depression, the precision rates are all above 92.59%, the recall rates are all above 92.89%, the F1 scores are all above 93.12%, and the overall classification accuracy is 94.80%. The research findings indicate that the proposed method effectively enhances classification accuracy in scenarios with limited data, exhibiting strong performance in depression identification and severity evaluation. In the future, this method has the potential to serve as a valuable supportive tool for depression diagnosis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A research on depression recognition based on voice pre-training model

Abstract

Talk to us

Similar Papers

More From: Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi

Lead the way for us

Similar Papers

Research on Text Classification Modeling Strategy Based on Pre-trained Language Model
Yiou Lin ... Yu Deng
-
Yiou Lin, et. al.Yiou Lin ... Yu Deng
04 Aug 2021
04 Aug 2021

Deep Learning Based Walking Tasks Classification in Older Adults Using fNIRS.
Dongning Ma ... Xun Jiao
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 31
Dongning Ma, et. al.Dongning Ma ... Xun Jiao
01 Jan 2023
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 31

Multi-Class Brain Disease Classification Using Modified Pre-Trained Convolutional Neural Networks Model with Substantial Data Augmentation
I Nandhini ... Vijayan Sugumaran
Journal of Medical Imaging and Health Informatics | VOL. 12
I Nandhini, et. al.I Nandhini ... Vijayan Sugumaran
01 Feb 2022
Journal of Medical Imaging and Health Informatics | VOL. 12

Performance evaluation of E-VGG19 model: Enhancing real-time skin cancer detection and classification
Irfan Ali Kandhro ... Abdulhalim Dandoush
Heliyon | VOL. 10
Irfan Ali Kandhro, et. al.Irfan Ali Kandhro ... Abdulhalim Dandoush
01 May 2024
Heliyon | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A research on depression recognition based on voice pre-training model

Abstract

Talk to us

Similar Papers

More From: Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi