Speech Emotion Recognition using Hybrid Architectures

Michael Norval,Zenghui Wang

doi:10.47839/ijc.23.1.3430

Abstract

The detection of human emotions from speech signals remains a challenging frontier in audio processing and human-computer interaction domains. This study introduces a novel approach to Speech Emotion Recognition (SER) using a Dendritic Layer combined with a Capsule Network (DendCaps). A Convolutional Neural Network (NN) and a Long Short-Time Neural Network (CLSTM) hybrid model are used to create a baseline which is then compared to the DendCap model. Integrating dendritic layers and capsule networks for speech emotion detection can harness the unique advantages of both architectures, potentially leading to more sophisticated and accurate models. Dendritic layers, inspired by the nonlinear processing properties of dendritic trees in biological neurons, can handle the intricate patterns and variabilities inherent in speech signals, while capsule networks, with their dynamic routing mechanisms, are adept at preserving hierarchical spatial relationships within the data, enabling the model to capture more refined emotional subtleties in human speech. The main motivation for using DendCaps is to bridge the gap between the capabilities of biological neural systems and artificial neural networks. This combination aims to capitalize on the hierarchical nature of speech data, where intricate patterns and dependencies can be better captured. Finally, two ensemble methods namely stacking and boosting are used for evaluating the CLSTM and DendCaps networks and the experimental results show that stacking of the CLSTM and DendCaps networks gives the superior result with a 75% accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition using Hybrid Architectures

Abstract

Talk to us

Similar Papers

More From: International Journal of Computing

Lead the way for us

Journal: International Journal of Computing	Publication Date: Apr 1, 2024
License type: cc-by

Similar Papers

ANN based decision fusion for speech emotion recognition
Lu Xu ... Mingxing Xu
-
Lu Xu, et. al.Lu Xu ... Mingxing Xu
06 Sep 2009
06 Sep 2009

An Ensemble Model for Multi-Level Speech Emotion Recognition
Chunjun Zheng ... Chunli Wang
Applied Sciences | VOL. 10
Chunjun Zheng, et. al.Chunjun Zheng ... Chunli Wang
26 Dec 2019
Applied Sciences | VOL. 10

Robustness comparison between the capsule network and the convolutional network for facial expression recognition
Donghui Li ... Xingcong Zhao
Applied Intelligence | VOL. 51
Donghui Li, et. al.Donghui Li ... Xingcong Zhao
02 Nov 2020
Applied Intelligence | VOL. 51

Novel dual-channel long short-term memory compressed capsule networks for emotion recognition
Ismail Shahin ... Kemal Polat
Expert Systems with Applications | VOL. 188
Ismail Shahin, et. al.Ismail Shahin ... Kemal Polat
19 Oct 2021
Expert Systems with Applications | VOL. 188

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Emotion Recognition using Hybrid Architectures

Abstract

Talk to us

Similar Papers

More From: International Journal of Computing