LSTPNet: Long short-term perception network for dynamic facial expression recognition in the wild

Chengcheng Lu,Yiben Jiang,Keren Fu,Qijun Zhao,Hongyu Yang

doi:10.1016/j.imavis.2024.104915

Abstract

In-the-wild dynamic facial expression recognition (DFER) is a very challenging task, and previous methods based on convolutional neural networks (CNNs), recurrent neural networks (RNNs), or Transformers emphasize the extraction of either short-term temporal information or long-term temporal information from facial video sequences. Different from existing methods, this paper proposes a long short-term perceptimon network (LSTPNet) for dynamic facial expression recognition, taking into account the joint perception of the above two temporal cues to benefit the DFER task. Specifically, we propose a long short-term temporal Transformer (LSTformer) which can perceive both long-term and short-term temporal information effectively. In addition, we introduce a temporal channel excitation (TCE) module extended from the previous notable efficient channel attention (ECA) module, in order to establish temporal attention for intermediate features within the backbone network, and obtain more temporally representative features. Experimental results on three benchmark datasets demonstrate the state-of-the-art performance of the proposed LSTPNet. The code will be available at https://github.com/LLFabiann/LSTPNet/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LSTPNet: Long short-term perception network for dynamic facial expression recognition in the wild

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Similar Papers

SleepContextNet: A temporal context network for automatic sleep staging based single-channel EEG
Caihong Zhao ... Yahong Guo
Computer Methods and Programs in Biomedicine | VOL. 220
Caihong Zhao, et. al.Caihong Zhao ... Yahong Guo
12 Apr 2022
Computer Methods and Programs in Biomedicine | VOL. 220

Attention-based Long-term Modeling for Deep Visual Odometry
Sangni Xu ... Qiuxia Wu
-
Sangni Xu, et. al.Sangni Xu ... Qiuxia Wu
01 Nov 2021
01 Nov 2021

Two-Stream 3D Convolution Attentional Network for Action Recognition
Raden Hadapiningsyah Kusumoseniarto
-
Raden Hadapiningsyah KusumoseniartoRaden Hadapiningsyah Kusumoseniarto
26 Aug 2020
26 Aug 2020

Photovoltaic generation forecasting using convolutional and recurrent neural networks
A Babalhavaeji ... S.A Gonzalez
Energy Reports | VOL. 9
A Babalhavaeji, et. al.A Babalhavaeji ... S.A Gonzalez
28 Sep 2023
Energy Reports | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LSTPNet: Long short-term perception network for dynamic facial expression recognition in the wild

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing