Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning

Shaoqi Yan,Yan Wang,Xinji Mai,Qing Zhao,Wei Song,Jun Huang,Zeng Tao,Haoran Wang,Shuyong Gao,Wenqiang Zhang

doi:10.1016/j.comcom.2023.12.032

Abstract

In the construction of smart cities, facial expression analysis plays a crucial role. It can be used in traffic monitoring systems to alleviate traffic pressure by analyzing the emotional states of drivers and passengers. In the field of smart healthcare, it can provide more precise treatment and services to patients. In the realm of social entertainment, it can offer more intelligent and personalized interactions. In summary, the application of emotion computing technology will play an increasingly significant role in the development of smart cities in the future. In the task of dynamic facial expression recognition (DFER), analyzing the spatial–temporal features of video sequences has become a common research approach. However, facial expression sequences often contain a significant number of neutral frames and noisy frames, potentially increasing computational costs and reducing performance. Effectively extracting key frames for spatial–temporal feature analysis is a critical aspect of dynamic facial expression recognition. To address this issue, we proposed a sampling-wise dynamic facial expression recognition via frame-Sequence contrastive learning method, called SW-FSCL. The SW-FSCL method aims to improve the performance of DFER by using intelligent dual-stream sampling strategies and frame-sequence contrastive learning, extract key frame and reduce the impact of neutral frames and noisy frames. We proposed a key frame proposal (KFP) block to analyze the spatial–temporal features of sequences, calculating weight ratios for key frame extraction. Due to potential information loss in long sequences, we introduce a temporal aggregation (TA) block to prevent data loss and ensure the integrity of temporal information. The experimental results provide compelling evidence that the proposed approach not only outperforms all current state-of-the-art algorithms on two widely-used benchmark datasets (DFEW, FERV39k), but also visualization results produces insights into the interpretability of the SW-FSCL method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning

Abstract

Talk to us

Similar Papers

More From: Computer Communications

Lead the way for us

Journal: Computer Communications	Publication Date: Dec 28, 2023
Citations: 2

Similar Papers

Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Hanting Li ... Feng Zhao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Hanting Li, et. al.Hanting Li ... Feng Zhao
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Dynamic facial expression recognition based on geometric and texture features
Ming Li ... Zengfu Wang
-
Ming Li, et. al.Ming Li ... Zengfu Wang
10 Apr 2018
10 Apr 2018

Facial expression recognition based on deep learning
Huilin Ge ... Xuedong Wu
Computer Methods and Programs in Biomedicine | VOL. 215
Huilin Ge, et. al.Huilin Ge ... Xuedong Wu
06 Jan 2022
Computer Methods and Programs in Biomedicine | VOL. 215

GAT-Net: A Network using Grid Attention and Transformer for Dynamic Facial Expression Recognition
Wenxin Dai ... Yaping Dai
-
Wenxin Dai, et. al.Wenxin Dai ... Yaping Dai
15 Aug 2022
15 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning

Abstract

Talk to us

Similar Papers

More From: Computer Communications