Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

Xuan Zhou

doi:10.3745/jips.01.0067

Abstract

Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems

Lead the way for us

Journal: Journal of Information Processing Systems	Publication Date: Apr 1, 2021
Citations: 2

Similar Papers

Video-Based Facial Expression Recognition: A Deep Learning Approach
Jeena Jacob ... J Jeba Sonia
-
Jeena Jacob, et. al.Jeena Jacob ... J Jeba Sonia
31 Aug 2021
31 Aug 2021

Video-Based Facial Expression Recognition using Deep Temporal–Spatial Networks
Xianzhang Pan ... Haibo Zhang
IETE Technical Review | VOL. 37
Xianzhang Pan, et. al.Xianzhang Pan ... Haibo Zhang
25 Jul 2019
IETE Technical Review | VOL. 37

Multiple Trajectory Prediction with Deep Temporal and Spatial Convolutional Neural Networks
Jan Strohbeck ... Vasileios Belagiannis
-
Jan Strohbeck, et. al.Jan Strohbeck ... Vasileios Belagiannis
24 Oct 2020
24 Oct 2020

Temporal convolutional network with soft threshold and contractile self-attention mechanism for remaining useful life prediction of rolling bearings
Hao Ma ... Huaiqian Bao
Measurement Science and Technology | VOL. 35
Hao Ma, et. al.Hao Ma ... Huaiqian Bao
05 Sep 2024
Measurement Science and Technology | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems