Abstract

State-of-the-art face recognition method have achieved satisfactory performance on various face datasets, but limited to high-resolution (HR) faces. In real scenes, with affects from camera quality or distance etc., low quality face images are often obtained in surveillance scenes, which leads to poor performance of general face recognition approaches. In this work, we explored the reasons why low-resolution (LR) images cannot be coped with the conventional models, and proposed the optimization methods. The main contribution of this work includes two aspects: (1) We proposed an adaptive random down-sampling data augmentation method with the corresponding loss function and training strategy. For the network training, we mapped the face images with different resolutions to the same embedding space, and then explicitly separated different classes of LR and gathered the same class of HR and LR. (2) A pooling module based on attention mechanism, area attention pooling, was proposed. After pooling, the fine-grained information of the identity is retained, and the redundant information of the non-identity is removed. We conducted a series of visualization experiments to prove the effectiveness of our method. We evaluated our methods on the real-world LR face dataset Surveillance Cameras Face Database (SCface) and the public face evaluation dataset Labeled Faces in the Wild (LFW). Compared with state-of-the-art methods, the proposed method achieves satisfactory performance at LR images on SCface and LFW dataset.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call