Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model

Zhiyong Wang,Guangqiang Diao,Lifeng Wang,Longxi Chen

doi:10.1109/access.2020.2998532

Zhiyong Wang, Guangqiang Diao + Show 2 more

Open Access

https://doi.org/10.1109/access.2020.2998532

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 21	License type: CC BY 4.0

Affiliation: Shandong Youth University of Political Science

Abstract

This paper proposes an audio depression recognition method based on convolution neural network and generative antagonism network model. First of all, preprocess the data set, remove the long-term mute segments in the data set, and splice the rest into a new audio file. Then, the features of speech signal, such as Mel-scale Frequency Cepstral Coefficients (MFCCs), short-term energy and spectral entropy, are extracted based on audio difference normalization algorithm. The extracted matrix vector feature data, which represents the unique attributes of the subjects' own voice, is the data base for model training. Then, based on the combination of CNN and GAN, DR AudioNet is used to build the model of depression recognition research. With the help of DR AudioNet, the former model is optimized and the recognition classification is completed through the normalization characteristics of the two adjacent segments before and after the current audio segment. The experimental results on AViD-Corpus and DAIC-WOZ datasets show that the proposed method effectively reduces the depression recognition error compared with other existing methods, and the RMSE and MAE values obtained on the two datasets are better than the comparison algorithm by more than 5%.

Highlights

With the improvement of people’s material life, mental health issues have received widespread attention
We proposed a novel deep learning algorithm which combine convolutional neural network (CNN) and generative antagonism network (GAN) and for Automatic Speech Depression Detection (ASDD)
This paper focuses on these three problems, and proposes an audio depression recognition method based on convolution neural network and generative antagonism network model

Summary

INTRODUCTION

With the improvement of people’s material life, mental health issues have received widespread attention. Clinical observations and studies have found that there is a significant correlation between the audio characteristics and the depression degrees [4], [5]. Z. Wang et al.: Recognition of Audio Depression Based on CNN and Generative Antagonism Network Model has been the focus of scholars due to its advantages of low cost, easy collection and non-contact [9], [10]. Compared with traditional machine learning methods, deep learning models can extracting high-level semantic features based on the neural network framework, which has brought breakthough progress in recent years. We proposed a novel deep learning algorithm which combine convolutional neural network (CNN) and generative antagonism network (GAN) and for ASDD. This paper focuses on these three problems, and proposes an audio depression recognition method based on convolution neural network and generative antagonism network model

RELATED WORK

DATA PREPROCESSING

Calculation of the spectrogram entropy

AUDIO DEPRESSION REGRESSION PREDICTION NETWORK

EXPERIMENTAL RESULTS AND ANALYSIS

EVALUATING INDICATOR

MODEL TRAINING

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Public Perception Analysis of Tweets During the 2015 Measles Outbreak: Comparative Study Using Convolutional Neural Network Models.
Jingcheng Du ... Cui Tao
Journal of Medical Internet Research | VOL. 20
Jingcheng Du, et. al.Jingcheng Du ... Cui Tao
09 Jul 2018
Journal of Medical Internet Research | VOL. 20

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Using deep learning for multivariate mapping of soil with quantified uncertainty
Alexandre M.J.-C Wadoux
Geoderma | VOL. 351
Alexandre M.J.-C WadouxAlexandre M.J.-C Wadoux
22 May 2019
Geoderma | VOL. 351

Application of PET/CT image under convolutional neural network model in postoperative pneumonia virus infection monitoring of patients with non-small cell lung cancer
Jing Wei ... Ahmed K.H Muttar
Results in Physics | VOL. 26
Jing Wei, et. al.Jing Wei ... Ahmed K.H Muttar
27 May 2021
Results in Physics | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access