Abstract

SummaryThis work concentrates on the recognition of facial emotion from video sequences with deep learning. Once the input video is converted into frames, the face detection is performed on each frame using the viola–jones face detection algorithm. Then, the feature extraction is performed by three well‐performing feature extraction techniques like modified local directional pattern, spatio‐temporal features, and scale‐invariant feature transforms. The extracted features from all the frames of the video are concatenated. To reduce the feature‐length for decreasing the training complexity, and enhance the recognition performance, the optimal feature selection is accomplished with the distance‐based tunicate swarm algorithm. These selected features are processed to an innovative deep learning model termed a heuristically modified recurrent neural network. The same D‐TSA improves the performance of RNN by optimally tuning its hidden neurons. Experimental results on a widely used benchmark dataset and manually collected dataset show that the classification performance is improved using spatio‐temporal features, SIFT, M‐LDP, and optimal feature selection, and thus, the proposed model with HM‐RNN outperforms the other existing models.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.