Spanish MEACorpus 2023: A multimodal speech–text corpus for emotion analysis in Spanish from natural environments

Ronghao Pan,José Antonio García-Díaz,Miguel Ángel Rodríguez-García,Rafel Valencia-García

doi:10.1016/j.csi.2024.103856

Abstract

In human–computer interaction, emotion recognition provides a deeper understanding of the user’s emotions, enabling empathetic and effective responses based on the user’s emotional state. While deep learning models have improved emotion recognition solutions, it is still an active area of research. One important limitation is that most emotion recognition systems use only text as input, ignoring features such as voice intonation. Another limitation is the limited number of datasets available for multimodal emotion recognition. In addition, most published datasets contain emotions that are simulated by professionals and produce limited results in real-world scenarios. In other languages, such as Spanish, hardly any datasets are available. Therefore, our contributions to emotion recognition are as follows. First, we compile and annotate a new corpus for multimodal emotion recognition in Spanish (Spanish MEACorpus 2023), which contains 13.16 h of speech divided into 5129 segments labeled by considering Ekman’s six basic emotions. The dataset is extracted from YouTube videos in natural environments. Second, we explore several deep learning models for emotion recognition using text- and audio-based features. Third, we evaluate different multimodal techniques to build a multimodal recognition system that improves the results of unimodal models, achieving a Macro F1-score of 87.745%, using late fusion with concatenation strategy approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spanish MEACorpus 2023: A multimodal speech–text corpus for emotion analysis in Spanish from natural environments

Abstract

Talk to us

Similar Papers

More From: Computer Standards & Interfaces

Lead the way for us

Journal: Computer Standards & Interfaces	Publication Date: Apr 2, 2024
License type: cc-by

Similar Papers

Cataloging of Happy Facial Affect Using a Radial Basis Function Neural Network
M Nachamai ... Pranti Dutta
-
M Nachamai, et. al.M Nachamai ... Pranti Dutta
01 Jan 2013
01 Jan 2013

A Systematic Review on Emotion Recognition System Using Physiological Signals: Data Acquisition and Methodology
Tawsif K ... Nor Azlina Ab Aziz
Emerging Science Journal | VOL. 6
Tawsif K, et. al.Tawsif K ... Nor Azlina Ab Aziz
17 Aug 2022
Emerging Science Journal | VOL. 6

Context-Aware Emotion Recognition in the Wild Using Spatio-Temporal and Temporal-Pyramid Models
Nhu-Tai Do ... Soonja Yeom
Sensors | VOL. 21
Nhu-Tai Do, et. al.Nhu-Tai Do ... Soonja Yeom
27 Mar 2021
Sensors | VOL. 21

Multimodal emotion recognition (MER) system
Kevin Tang ... Ling Guan
-
Kevin Tang, et. al.Kevin Tang ... Ling Guan
01 May 2014
01 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spanish MEACorpus 2023: A multimodal speech–text corpus for emotion analysis in Spanish from natural environments

Abstract

Talk to us

Similar Papers

More From: Computer Standards &amp; Interfaces

More From: Computer Standards & Interfaces