Real-time masked face classification and head pose estimation for RGB facial image via knowledge distillation

Chien Thai,Viet Tran,Minh Bui,Dat Nguyen,Huong Ninh,Hai Tran

doi:10.1016/j.ins.2022.10.074

Abstract

Recently, human head pose estimation and masked face classification are two essential problems in facial analysis. It is necessary to design a compact model to resolve both tasks in order to reduce the computational cost when deploying face recognition-based applications such as camera surveillance systems, AI cameras while maintaining accuracy. In this work, we proposed a lightweight multi-task model called MHPNet that simultaneously addresses both head pose estimation and masked face classification problems. Because of the lack of datasets with available labels for both tasks, we first train teacher models independently on two labelled datasets 300 W-LPA and MAFA to extract the head pose and masked soft label. After that, we design architecture with ResNet18 backbone and two branches for two tasks and train our proposed model with the predictions of teacher models on joint datasets via the knowledge distillation process. To evaluate the effectiveness of our model, we use AFLW2000 and BIWI datasets for head pose estimation problems and MAFA datasets for masked face classification problems. Experiment results show that our proposed model significantly improves the accuracy compared to the state-of-the-art head pose estimation methods and achieves remarkable performance on masked face dataset. Furthermore, our model has the real-time speed of ∼400 FPS when inferring on a Tesla V100 GPU device. Source codes and datasets are available at https://github.com/chientv99/maskpose.

Full Text