Enhanced spatial-temporal learning network for dynamic facial expression recognition

Weijun Gong,Yurong Qian,Weihang Zhou,Hongyong Leng

doi:10.1016/j.bspc.2023.105316

Abstract

The recognition of dynamic facial expressions has received increasing attention since they can better reflect the real expression process of emotion than a static image. However, due to various factors such as subtle variation differences, pose, occlusion, and illumination, it has been a challenging vision task to obtain discriminative expression features in dynamic facial expression recognition. Traditional CNN-based deep learning networks lack global and temporal contextual expression understanding, which tends to affect the final recognition of dynamic expressions. Therefore, we propose an enhanced spatial–temporal learning network (ESTLNet) for more robust dynamic facial expression recognition, which consists of a spatial fusion learning module (SFLM) and a temporal transformer enhancement module (TTEM). First, the SFLM obtains a more expressive spatial feature representation through a dual-channel feature fusion learning module. Then, the TTEM extracts more valid temporal contextual expression features based on the above spatial features through an encoder constructed by cascading a self-attention learning network and an effective gated feed-forward network. Finally, the co-enhanced spatial–temporal model approach is assessed on the four broadly used dynamic expression datasets (DFEW, AFEW, CK+, and Oulu-CASIA). Extensive experimental outcomes demonstrate that our approach surpasses several existing state-of-the-art methods, leading to notable enhancements in performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhanced spatial-temporal learning network for dynamic facial expression recognition

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Journal: Biomedical Signal Processing and Control	Publication Date: Aug 21, 2023
Citations: 7

Similar Papers

Neural models for the cross-species recognition of dynamic facial expressions
Michael Stettler ... Silvia Spadacenta
Journal of Vision | VOL. 21
Michael Stettler, et. al.Michael Stettler ... Silvia Spadacenta
27 Sep 2021
Journal of Vision | VOL. 21

Strategy Shift Toward Lower Spatial Frequencies in the Recognition of Dynamic Facial Expressions of Basic Emotions: When It Moves It Is Different.
Marie-Pier Plouffe-Demers ... Caroline Blais
Frontiers in psychology | VOL. 10
Marie-Pier Plouffe-Demers, et. al.Marie-Pier Plouffe-Demers ... Caroline Blais
17 Jul 2019
Frontiers in psychology | VOL. 10

Tracking the recognition of static and dynamic facial expressions of emotion across the life span.
Anne-Raphaëlle Richoz ... Roberto Caldara
Journal of Vision | VOL. 18
Anne-Raphaëlle Richoz, et. al.Anne-Raphaëlle Richoz ... Roberto Caldara
07 Sep 2018
Journal of Vision | VOL. 18

Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Hanting Li ... Feng Zhao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Hanting Li, et. al.Hanting Li ... Feng Zhao
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced spatial-temporal learning network for dynamic facial expression recognition

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control