Facial expression recognition in the wild based on multimodal texture features

Bo Sun,Liandong Li,Guoyan Zhou,Jun He

doi:10.1117/1.jei.25.6.061407

Bo Sun, Liandong Li + Show 2 more

Open Access

https://doi.org/10.1117/1.jei.25.6.061407

Copy DOI

Journal: Journal of Electronic Imaging	Publication Date: Jun 22, 2016
Citations: 61	License type: cc-by

Affiliation: Beijing Normal University

Abstract

Facial expression recognition in the wild is a very challenging task. We describe our work in static and continuous facial expression recognition in the wild. We evaluate the recognition results of gray deep features and color deep features, and explore the fusion of multimodal texture features. For the continuous facial expression recognition, we design two temporal–spatial dense scale-invariant feature transform (SIFT) features and combine multimodal features to recognize expression from image sequences. For the static facial expression recognition based on video frames, we extract dense SIFT and some deep convolutional neural network (CNN) features, including our proposed CNN architecture. We train linear support vector machine and partial least squares classifiers for those kinds of features on the static facial expression in the wild (SFEW) and acted facial expression in the wild (AFEW) dataset, and we propose a fusion network to combine all the extracted features at decision level. The final achievement we gained is 56.32% on the SFEW testing set and 50.67% on the AFEW validation set, which are much better than the baseline recognition rates of 35.96% and 36.08%.

Highlights

With the development of artificial intelligence and affective computing, facial expression recognition has shown prospects in human–computer interfaces, online education, entertainment, intelligent environments, and so on
As the application environment turns into a real world scenario, those methods using the monomial feature such as local binary patterns (LBP)[1] or bag of visual words[2] cannot achieve promising results
Our experiment shows that the new temporal–spatial descriptor, namely scale-invariant feature transform (SIFT)-LBP, has better performance

Summary

Introduction

With the development of artificial intelligence and affective computing, facial expression recognition has shown prospects in human–computer interfaces, online education, entertainment, intelligent environments, and so on. Much research has been done on the data collected in strictly controlled laboratory settings with frontal faces, perfect illumination, and posed expressions. As the application environment turns into a real world scenario, those methods using the monomial feature such as local binary patterns (LBP)[1] or bag of visual words[2] cannot achieve promising results. Unlike the lab-controlled dataset, human heads in a real environment can be in any position of an image with all sorts of angles and poses. For most automatic facial expression recognition methods, the first step is to locate and extract the position of a face in the whole scene. Some methods, such as mixture of parts (MoPs)[4] and supervised descent method,[5] have robust face detection results in various head rotations

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Facial expression recognition in the wild based on multimodal texture features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Similar Papers

Facial Expression Detection and Recognition through VIOLA-JONES Algorithm and HCNN using LSTM Method
Dinesh Kumar P ... Dr B Rosiline Jeetha
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -
Dinesh Kumar P, et. al.Dinesh Kumar P ... Dr B Rosiline Jeetha
12 Jun 2021
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -

Object detection and classification: a joint selection and fusion strategy of deep convolutional neural network and SIFT point features
Muhammad Rashid ... Farhat Afza
Multimedia Tools and Applications | VOL. 78
Muhammad Rashid, et. al.Muhammad Rashid ... Farhat Afza
08 Dec 2018
Multimedia Tools and Applications | VOL. 78

A Video-Based Facial Motion Tracking and Expression Recognition System
Jun Yu ... Zengfu Wang
Multimedia Tools and Applications | VOL. 76
Jun Yu, et. al.Jun Yu ... Zengfu Wang
01 Sep 2016
Multimedia Tools and Applications | VOL. 76

M3. THEORY OF MIND IN INDIVIDUALS WITH FIRST-EPISODE OF SCHIZOPHRENIA AND CHILDHOOD TRAUMA
Luis Sanchez Pastor ... Iosune Torio
Schizophrenia Bulletin | VOL. 46
Luis Sanchez Pastor, et. al.Luis Sanchez Pastor ... Iosune Torio
18 May 2020
Schizophrenia Bulletin | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Facial expression recognition in the wild based on multimodal texture features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Electronic Imaging