Abstract

This article proposes a feature extraction method for an integrated face tracking and facial expression recognition in real time video. The method proposed by Viola and Jones [1] is used to detect the face region in the first frame of the video. A rectangular bounding box is fitted over for the face region and the detected face is tracked in the successive frames using the cascaded Support vector machine (SVM) and cascaded Radial basis function neural network (RBFNN). The haar-like features are extracted from the detected face region and they are used to create a cascaded SVM and RBFNN classifiers. Each stage of the SVM classifier and RBFNN classifier rejects the non-face regions and pass the face regions to the next stage in the cascade thereby efficiently tracking the face. The performance of tracking is evaluated using one hour video data. The performance of the cascaded SVM is compared with the cascaded RBFNN. The experiment results show that the proposed cascaded SVM classifier method gives better performance over the RBFNN and also the methods described in the literature using single SVM classifier [2]. While the face is being tracked, features are extracted from the mouth region for expression recognition. The features are modelled using a multi-class SVM. The SVM finds an optimal hyperplane to distinguish different facial expressions with an accuracy of 96.0%.

Highlights

  • The proposed work focuses mainly on recognizing the facial expressions of a user interacting with the computer

  • This paper proposes a method for face tracking and facial expression recognition using cascaded Support vector machine (SVM) and cascaded Radial basis function neural network (RBFNN) classifiers

  • The face is tracked in consecutive frames by classifying the face and non face regions using the cascaded SVM and cascaded RBFNN

Read more

Summary

Introduction

The proposed work focuses mainly on recognizing the facial expressions of a user interacting with the computer. Tracking requires a method for face detection which determines the image location of a face in the first frame of a video sequence. A number of methods have been proposed for face detection which include neural networks [3,4], wavelet basis functions [5] and Bayesian discriminating features [6]. A method is proposed using haar-like features for feature selection and a cascade of SVMs and a cascade of RBFNNs for classification. The detected face and non face regions are modelled using cascaded SVMs and cascaded RBFNNs. The trained models are used to track the face region in the video sequence. In this work a multi-class SVM is used for modelling mouth features and to classify the facial expressions.

Face Detection
Facial Feature Extraction
Support Vector Machines Machine
Radial Basis Function Neural Networks
Modelling of Mouth Region for Expression Classification
Experimental Results
Conclusions and Future Work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call