Face attribute prediction using off-the-shelf CNN features

Yang Zhong Yang Zhong,Haibo Li Haibo Li,Josephine Sullivan

doi:10.1109/icb.2016.7550092

Yang Zhong Yang Zhong, Haibo Li Haibo Li + Show 1 more

Open Access

https://doi.org/10.1109/icb.2016.7550092

Copy DOI

Publication Date: Jan 1, 2016
Citations: 81	License type: other-oa

Affiliation: KTH Royal Institute of Technology

Abstract

Predicting attributes from face images in the wild is a challenging computer vision problem. To automatically describe face attributes from face containing images, traditionally one needs to cascade three technical blocks - face localization, facial descriptor construction, and attribute classification - in a pipeline. As a typical classification problem, face attribute prediction has been addressed using deep learning. Current state-of-the-art performance was achieved by using two cascaded Convolutional Neural Networks (CNNs), which were specifically trained to learn face localization and attribute description. In this paper, we experiment with an alternative way of employing the power of deep representations from CNNs. Combining with conventional face localization techniques, we use off-the-shelf architectures trained for face recognition to build facial descriptors. Recognizing that the describable face attributes are diverse, our face descriptors are constructed from different levels of the CNNs for different attributes to best facilitate face attribute prediction. Experiments on two large datasets, LFWA and CelebA, show that our approach is entirely comparable to the state-of-the-art. Our findings not only demonstrate an efficient face attribute prediction approach, but also raise an important question: how to leverage the power of off-the-shelf CNN representations for novel tasks.

Full Text