Investigation of multilingual and mixed-lingual emotion recognition using enhanced cues with data augmentation

S Lalitha,Deepa Gupta,Mohammed Zakariah,Yousef Ajami Alotaibi

doi:10.1016/j.apacoust.2020.107519

Abstract

In the past decade, research for improving man–machine communication has focused on emotion recognition using audio cues. Several effective monolingual, multilingual, and cross-corpus speech emotion recognition (SER) systems have been developed; however, they are limited to recognizing emotions from databases of monolingual discourse, primarily in either categorical or dimensional emotion space. For multilingual countries and federations such as India, Russia, and the European Union, these limitations can be problematic. Furthermore, in an environment of mixed diversified languages, the performance of existing models is unclear. To address these issues, we propose an innovative mixed-lingual SER system that considers five diverse languages, including dialect variability. Mixed-lingual corpora are developed from available standard speech emotion databases. Furthermore, a compact feature set having a unique set of speech feature functionals with a distinctive set of enhanced perceptual features and modified H-coefficients is proposed. Against existing large feature sets, the proposed compact feature set is robust and effective to perform the dual task of significantly recognizing different emotions from multilingual SER systems also, along with the mixed-lingual SER systems in both emotion spaces of categorical and dimensional. In the proposed SER system, to overcome the skewness of SER system performance for recognizing certain emotions, a data augmentation method is then incorporated. Furthermore, the proposed SER system is designed to efficiently recognize even the extreme emotions of boredom, disgust, sadness, and surprise in both emotion spaces. The proposed SER system is compared to existing SER systems, and the comparison results demonstrate that the proposed system outperformed existing systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigation of multilingual and mixed-lingual emotion recognition using enhanced cues with data augmentation

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics

Lead the way for us

Journal: Applied Acoustics	Publication Date: Jul 22, 2020
Citations: 21

Similar Papers

Speech emotion recognition systems and their security aspects
Itzik Gurowiec ... Nir Nissim
Artificial Intelligence Review | VOL. 57
Itzik Gurowiec, et. al.Itzik Gurowiec ... Nir Nissim
21 May 2024
Artificial Intelligence Review | VOL. 57

Speech emotion recognition using data augmentation method by cycle-generative adversarial networks
Arash Shilandari ... Hossein Khosravi
Signal, Image and Video Processing | VOL. 16
Arash Shilandari, et. al.Arash Shilandari ... Hossein Khosravi
09 Feb 2022
Signal, Image and Video Processing | VOL. 16

RMWSaug: Robust Multi-window Spectrogram Augmentation Approach for Deep Learning based Speech Emotion Recognition
Shehu Mohammed Yusuf ... E A Adedokun
-
Shehu Mohammed Yusuf, et. al.Shehu Mohammed Yusuf ... E A Adedokun
06 Oct 2021
06 Oct 2021

Emotional speech Recognition using CNN and Deep learning techniques
C Hema ... Fausto Pedro Garcia Marquez
Applied Acoustics | VOL. 211
C Hema, et. al.C Hema ... Fausto Pedro Garcia Marquez
28 Jun 2023
Applied Acoustics | VOL. 211

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigation of multilingual and mixed-lingual emotion recognition using enhanced cues with data augmentation

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics