Abstract
Facial expressions have been proven to be the most effective way for the brain to recognize human emotions in a variety of contexts. With the exponentially increasing research for emotion detection in recent years, facial expression recognition has become an attractive, hot research topic to identify various basic emotions. Happy emotion is one of such basic emotions with many applications, which is more likely recognized by facial expressions than other emotion measurement instruments (e.g., audio/speech, textual and physiological sensing). Nowadays, most methods have been developed for identifying multiple types of emotions, which aim to achieve the best overall precision for all emotions; it is hard for them to optimize the recognition accuracy for single emotion (e.g., happiness). Only a few methods are designed to recognize single happy emotion captured in the unconstrained videos; however, their limitations lie in that the processing of severe head pose variations has not been considered, and the accuracy is still not satisfied. In this paper, we propose a Happy Emotion Recognition model using the 3D hybrid deep and distance features (HappyER-DDF) method to improve the accuracy by utilizing and extracting two different types of deep visual features. First, we employ a hybrid 3D Inception-ResNet neural network and long-short term memory (LSTM) to extract dynamic spatial-temporal features among sequential frames. Second, we detect facial landmarks’ features and calculate the distance between each facial landmark and a reference point on the face (e.g., nose peak) to capture their changes when a person starts to smile (or laugh). We implement the experiments using both feature-level and decision-level fusion techniques on three unconstrained video datasets. The results demonstrate that our HappyER-DDF method is arguably more accurate than several currently available facial expression models.
Highlights
Emotion recognition has become a hot, attractive research area, which has a wide range of applications
There are several types of emotion recognition systems based on different cues for detecting human emotion states such as facial expression recognition (FER) [8], speech emotion recognition [9], physiological emotion recognition [10]; they can be combined into multimodal systems [11], [12] to detect human emotions
This paper proposes a novel HappyER-DDF method that adopts a hybrid deep neural network to recognize the happy emotions from unconstrained videos
Summary
Emotion recognition has become a hot, attractive research area, which has a wide range of applications. It can be utilized for an emotional understanding of customers in the advertising industry. Lie detection can be eased by facial expression recognition and physiological states in the crime and court domain [1]. It can be useful in diagnosing some diseases like anxiety and Parkinson’s in medical applications [2], [3]. The traditional FER systems were developed to recognize only the facial expressions on the lab-controlled datasets with high accuracy of over 97% when
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.