Adaptive Multimodal Emotion Detection Architecture for Social Robots

Juanpablo Heredia,Edmundo Lopes-Silva,Ana Aguilera,Jose Diaz-Amado,Yudith Cardinale,Irvin Dongo,Wilfredo Graterol

doi:10.1109/access.2022.3149214

Abstract

Emotion recognition is a strategy for social robots used to implement better Human-Robot Interaction and model their social behaviour. Since human emotions can be expressed in different ways (e.g., face, gesture, voice), multimodal approaches are useful to support the recognition process. However, although there exist studies dealing with multimodal emotion recognition for social robots, they still present limitations in the fusion process, dropping their performance if one or more modalities are not present or if modalities have different qualities. This is a common situation in social robotics, due to the high variety of the sensory capacities of robots; hence, more flexible multimodal models are needed. In this context, we propose an adaptive and flexible emotion recognition architecture able to work with multiple sources and modalities of information and manage different levels of data quality and missing data, to lead robots to better understand the mood of people in a given environment and accordingly adapt their behaviour. Each modality is analyzed independently to then aggregate the partial results with a previous proposed fusion method, called EmbraceNet+, which is adapted and integrated to our proposed framework. We also present an extensive review of state-of-the-art studies dealing with fusion methods for multimodal emotion recognition approaches. We evaluate the performance of our proposed architecture by performing different tests in which several modalities are combined to classify emotions using four categories (i.e., happiness, neutral, sadness, and anger). Results reveal that our approach is able to adapt to the quality and presence of modalities. Furthermore, results obtained are validated and compared with other similar proposals, obtaining competitive performance with state-of-the-art models.

Highlights

In people social interactions, emotion detection is a natural process that directly affects people’s decision-making and actions during communication
There exist studies dealing with multimodal emotion recognition for social robots [7], [17], [18], they still present a limitation in the fusion process: they can drop their performance if one or more modalities are not present or if modalities have different qualities. This is a common situation in social robotics, since robots can have a high variety of sensory capacities and might capture the word through different sources and with different levels of quality; more flexible multimodal models are needed
We review two groups of late fusion methods: those based on Multi Layer Perceptron (MLP) [33]–[35] and those based on more complex models, such as combinations of Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Long ShortTerm Memory (LSTM), and others [36]–[39]

Summary

Introduction

Emotion detection is a natural process that directly affects people’s decision-making and actions during communication. Robots can detect the emotion of human beings through visual perception [1], speech [2], nonverbal communication [3], mutual interaction [4], among others methods. In this sense, new proposals for social robots to detect emotions have become more naturalized and faster in recent years for better understanding of how to communicate with people [5].

Methods

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 37	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Adaptive Multimodal Emotion Detection Architecture for Social Robots

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

An Improved Multimodal Dimension Emotion Recognition Based on Different Fusion Methods
Haiyang Su ... Bin Liu
-
Haiyang Su, et. al.Haiyang Su ... Bin Liu
06 Dec 2020
06 Dec 2020

Multimodal Emotion Recognition with Thermal and RGB-D Cameras for Human-Robot Interaction
Chuang Yu ... Adriana Tapus
-
Chuang Yu, et. al.Chuang Yu ... Adriana Tapus
23 Mar 2020
23 Mar 2020

Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis
Loic Kessous ... George Caridakis
Journal on Multimodal User Interfaces | VOL. 3
Loic Kessous, et. al.Loic Kessous ... George Caridakis
12 Dec 2009
Journal on Multimodal User Interfaces | VOL. 3

A multimodal fusion emotion recognition method based on multitask learning and attention mechanism
Jinbao Xie ... Yury I Varatnitski
Neurocomputing | VOL. 556
Jinbao Xie, et. al.Jinbao Xie ... Yury I Varatnitski
04 Aug 2023
Neurocomputing | VOL. 556

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Multimodal Emotion Detection Architecture for Social Robots

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access