Estimating the Intensity of Facial Expressions Accompanying Feedback Responses in Multiparty Video-Mediated Communication

Ryosuke Ueno,Jie Zeng,Fumio Nihei,Yukiko I Nakano

doi:10.1145/3382507.3418878

Abstract

Providing feedback to a speaker is an essential communication signal for maintaining a conversation. In specific feedback, which indicates the listener's reaction to the speaker?s utterances, the facial expression is an effective modality for conveying the listener's reactions. Moreover, not only the type of facial expressions, but also the degree of intensity of the expressions, may influence the meaning of the specific feedback. In this study, we propose a multimodal deep neural network model that predicts the intensity of facial expressions co-occurring with feedback responses. We focus on multiparty video-mediated communication. In video-mediated communication, close-up frontal face images of each participant are continuously presented on the display; the attention of the participants is more likely to be drawn to the facial expressions. We assume that in such communication, the importance of facial expression in the listeners? feedback responses increases. We collected 33 video-mediated conversations by groups of three people and obtained audio and speech data for each participant. Using the corpus collected as a dataset, we created a deep neural network model that predicts the intensity of 17 types of action units (AUs) co-occurring with the feedback responses. The proposed method employed GRU-based model with attention mechanism for audio, visual, and language modalities. A decoder was trained to produce the intensity values for the 17 AUs frame by frame. In the experiment, unimodal and multimodal models were compared in terms of their performance in predicting salient AUs that characterize facial expression in feedback responses. The results suggest that well-performing models differ depending on the AU categories; audio information was useful for predicting AUs that express happiness, and visual and language information contributes to predicting AUs expressing sadness and disgust.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating the Intensity of Facial Expressions Accompanying Feedback Responses in Multiparty Video-Mediated Communication

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Viewing distance matter to perceived intensity of facial expressions.
Andreas Gerhardsson ... Håkan Fischer
Frontiers in Psychology | VOL. 6
Andreas Gerhardsson, et. al.Andreas Gerhardsson ... Håkan Fischer
02 Jul 2015
Frontiers in Psychology | VOL. 6

Real-time estimation of facial expression intensity
Ka Keung Lee ... Yangsheng Xu
-
Ka Keung Lee, et. al. Ka Keung Lee ... Yangsheng Xu
10 Nov 2003
10 Nov 2003

Automated recognition of spontaneous facial expression in individuals with autism spectrum disorder: parsing response variability
Abigail Bangerter ... Seth Ness
Molecular Autism | VOL. 11
Abigail Bangerter, et. al.Abigail Bangerter ... Seth Ness
11 May 2020
Molecular Autism | VOL. 11

Training machine learning algorithms for automatic facial coding: The role of emotional facial expressions' prototypicality.
Björn Büdenbender ... Tim T A Höfling
PloS one | VOL. 18
Björn Büdenbender, et. al.Björn Büdenbender ... Tim T A Höfling
10 Feb 2023
PloS one | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating the Intensity of Facial Expressions Accompanying Feedback Responses in Multiparty Video-Mediated Communication

Abstract

Talk to us

Similar Papers