“Stream loss”: ConvNet learning for face verification using unlabeled videos in the wild

Elaheh Rashedi,Elaheh Barati,Matthew Nokleby,Xue-Wen Chen

doi:10.1016/j.neucom.2018.10.041

Elaheh Rashedi, Elaheh Barati + Show 2 more

Open Access

https://doi.org/10.1016/j.neucom.2018.10.041

Copy DOI

Journal: Neurocomputing	Publication Date: Oct 24, 2018
Citations: 8	License type: publisher-specific-oa

Affiliation: Wayne State University

Abstract

Face recognition tasks have seen a significantly improved performance due to ConvNets. However, less attention has been given to face verification from videos. This paper makes two contributions along these lines. First, we propose a method, called stream loss, for learning ConvNets using unlabeled videos in the wild. Second, we present an approach for generating a face verification dataset from videos in which the labeled streams can be created automatically without human annotation intervention. Using this approach, we have assembled a widely scalable dataset, FaceSequence, which includes 1.5M streams capturing ∼ 500K individuals. Using this dataset, we trained our network to minimize the stream loss. The network achieves accuracy comparable to the state-of-the-art on the LFW and YTF datasets with much smaller model complexity. We also fine-tuned the network using the IJB-A dataset. The validation results show competitive accuracy compared with the best previous video face verification results.

Full Text