Speech Emotion Recognition Based on Deep Residual Shrinkage Network

Tian Han,Mingyuan Ren,Zhu Zhang,Changchun Dong,Xiaolin Jiang,Quansheng Zhuang

doi:10.3390/electronics12112512

Abstract

Speech emotion recognition (SER) technology is significant for human–computer interaction, and this paper studies the features and modeling of SER. Mel-spectrogram is introduced and utilized as the feature of speech, and the theory and extraction process of mel-spectrogram are presented in detail. A deep residual shrinkage network with bi-directional gated recurrent unit (DRSN-BiGRU) is proposed in this paper, which is composed of convolution network, residual shrinkage network, bi-directional recurrent unit, and fully-connected network. Through the self-attention mechanism, DRSN-BiGRU can automatically ignore noisy information and improve the ability to learn effective features. Network optimization, verification experiment is carried out in three emotional datasets (CASIA, IEMOCAP, and MELD), and the accuracy of DRSN-BiGRU are 86.03%, 86.07%, and 70.57%, respectively. The results are also analyzed and compared with DCNN-LSTM, CNN-BiLSTM, and DRN-BiGRU, which verified the superior performance of DRSN-BiGRU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Jun 2, 2023
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Based on Deep Residual Shrinkage Network

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Vehicle logo recognition based on depth residual shrinkage network
Zhaobo Lin
-
Zhaobo LinZhaobo Lin
29 Apr 2023
29 Apr 2023

Research on image classification algorithm based on depth residuals shrinkage network in Commercial Image Library
Jiantao Zhao ... Wenxin Chen
Journal of Physics: Conference Series | VOL. 2010
Jiantao Zhao, et. al.Jiantao Zhao ... Wenxin Chen
01 Sep 2021
Journal of Physics: Conference Series | VOL. 2010

Modulation recognition of communication signals based on deep learning
Jun He ... Pengju Li
-
Jun He, et. al.Jun He ... Pengju Li
17 Dec 2021
17 Dec 2021

Cell Phenotype Classification Using Deep Residual Network and Its Variants
Qicheng Lao ... Thomas Fevens
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 33
Qicheng Lao, et. al.Qicheng Lao ... Thomas Fevens
01 Oct 2019
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition Based on Deep Residual Shrinkage Network

Abstract

Talk to us

Similar Papers

More From: Electronics