CNNs with cross-correlation matching for face recognition in video surveillance using a single training sample per person

Mostafa Parchami,Eric Granger,Saman Bashbaghi

doi:10.1109/avss.2017.8078554

Abstract

In video surveillance, face recognition (FR) systems seek to detect individuals of interest appearing over a distributed network of cameras. Still-to-video FR systems match faces captured in videos under challenging conditions against facial models, often designed using one reference still per individual. Although CNNs can achieve among the highest levels of accuracy in many real-world FR applications, state-of-the-art CNNs that are suitable for still-to-video FR, like trunk-branch ensemble (TBE) CNNs, represent complex solutions for real-time applications. In this paper, an efficient CNN architecture is proposed for accurate still-to-video FR from a single reference still. The CCM-CNN is based on new cross-correlation matching (CCM) and triplet-loss optimization methods that provide discriminant face representations. The matching pipeline exploits a matrix Hadamard product followed by a fully connected layer inspired by adaptive weighted cross-correlation. A triplet-based training approach is proposed to optimize the CCM-CNN parameters such that the inter-class variations are increased, while enhancing robustness to intra-class variations. To further improve robustness, the network is fine-tuned using synthetically-generated faces based on still and videos of non-target individuals. Experiments on videos from the COX Face and Chokepoint datasets indicate that the CCM-CNN can achieve a high level of accuracy that is comparable to TBE-CNN and HaarNet, but with a significantly lower time and memory complexity. It may therefore represent the better trade-off between accuracy and complexity for real-time video surveillance applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CNNs with cross-correlation matching for face recognition in video surveillance using a single training sample per person

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Domain-Specific Face Synthesis for Video Face Recognition From a Single Sample Per Person
Fania Mokhayeri ... Eric Granger
IEEE Transactions on Information Forensics and Security | VOL. 14
Fania Mokhayeri, et. al.Fania Mokhayeri ... Eric Granger
01 Mar 2019
IEEE Transactions on Information Forensics and Security | VOL. 14

Deep Learning Architectures for Face Recognition in Video Surveillance
Saman Bashbaghi ... Mostafa Parchami
-
Saman Bashbaghi, et. al.Saman Bashbaghi ... Mostafa Parchami
27 Feb 2018
27 Feb 2018

Surveillance video face recognition with single sample per person based on 3D modeling and blurring
Xiao Hu ... Zhaowen Li
Neurocomputing | VOL. 235
Xiao Hu, et. al.Xiao Hu ... Zhaowen Li
05 Jan 2017
Neurocomputing | VOL. 235

<title>Toward fast feature adaptation and localization for real-time face recognition systems</title>
Fei Zuo ... Peter H De With
-
Fei Zuo, et. al.Fei Zuo ... Peter H De With
16 Jun 2003
16 Jun 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNNs with cross-correlation matching for face recognition in video surveillance using a single training sample per person

Abstract

Talk to us

Similar Papers