This study develops a novel statistical system for automatic multi-view face detection and pose estimation. The five-module detection system is based on significant local facial features (or subregions) rather than the entire face. The low- and high-frequency feature information of each subregion of the facial image are extracted and projected onto the eigenspace and residual independent basis space in order to create the corresponding PCA (principal component analysis) projection weight vector and ICA (independent component analysis) coefficient vector, respectively. Therefore, the proposed system has an improved tolerance toward different facial expressions, wide viewing angles, partial occlusions and lighting conditions. Furthermore, either projection weight vectors or coefficient vectors in the PCA or ICA space have divergent distributions and are therefore modeled by using the weighted Gaussian mixture model (GMM) rather than a single Gaussian model. The GMM weights and parameters of the GMM are estimated iteratively using the Expectation-Maximization (EM) algorithm. Face detection is then performed by conducting a likelihood evaluation process based on the estimated joint probability of the weight and coefficient vectors and the corresponding geometric positions of the subregions. The use of subregion position information can reduce the risk of false acceptances. Moreover, simple cascaded rejecter module is employed to exclude 85% of the non-face images in order to enhance the overall system performance. The computational overhead is further reduced by eliminating the requirement for a residual image reconstruction process in the ICA process. Finally, the performance of the proposed system is evaluated using challenging databases. The results not only demonstrate the ability of the system to automatically identify facial images with a high degree of accuracy, but also verify its ability to estimate the fine pose angles with 5° precision and an over 90% accuracy rate.
Read full abstract