Intra-class Information Research Articles

Deep learning models for still-to-video FR typically provide a low level of accuracy because faces captured in unconstrained videos are matched against a reference gallery comprised of a single facial still per individual. For improved robustness to intra-class variations, deep Siamese networks have recently been used for pair-wise face matching. Although these networks can improve state-of-the-art accuracy, the absence of prior knowledge from the target domain means that many images must be collected to account for all possible capture conditions, which is not practical for many real-world surveillance applications. In this paper, we propose the deep SiamSRC network that employs block-sparsity for face matching, while the reference gallery is augmented with a compact set of domain-specific facial images. Prior to deployment, clustering based on row sparsity is performed on unlabelled faces captured in videos from the target domain. Cluster centers discovered in the capture condition space (defined by, e.g., pose, scale and illumination) are used as rendering parameters with an off-the-shelf 3D face model, and a compact set of synthetic faces are thereby generated for each reference still based on representative intra-class information from the target domain. For pair-wise similarity matching with query facial images, the SiamSRC exploits sparse representation-based classification with a block structure. Experimental results obtained with the videos from the Chokepoint and COX-S2V datasets indicate that the proposed SiamSRC network can outperform state-of-the-art methods for still-to-video FR with a single sample per person, with only a moderate increase in computational complexity.

Read full abstract

Heterogeneous face recognition (HFR) is still a challenging problem in computer vision community due to large appearance difference between near infrared (NIR) and visible light (VIS) modalities. Recently, breakthroughs have been made for traditional face recognition by applying deep learning on a huge amount of labeled VIS face samples. However, the same deep learning approach cannot be simply applied to HFR task due to large domain difference as well as insufficient pairwise images in different modalities during training. In general, the pooling layer of deep network can play the role of feature reduction, but also lead to the loss of useful face information, resulting in a decrease in the performance of HFR problem. It is important to eliminate modal-related information and retain more facial identity information. In this paper, we propose a novel method called Discriminant Deep Feature Learning Based on Joint Supervision Loss and Multi-layer Feature Fusion (DDFLJM) for HFR task. In most of the available CNNs, the softmax loss function is used as the supervision signal to train the deep model. In order to enhance the discriminative power of the deeply learned features, this paper proposes a new loss function called Scatter Loss (SL), which embeds both inter- and intra-class information for effectively training the deep model. To make full use of the various layers of the deep network, a Dimension Reduction Block (DRB) is designed to effectively extract the auxiliary features on multiple mid-level layers. An orthogonality constraint is introduced to the DRB block to reduce spectrum variations of two different modalities. The proposed SL is applied to multiple layers of network for joint supervision training, which enables multiple layers of the network to obtain discriminative identity features. Moreover, a Modified Gate Two-stream Neural Network (MGTNN) is adopted to fuse multiple-layer features. Extensive experiments are carried out on two challenging NIR-VIS HFR datasets CASIA NIR-VIS 2.0 and Oulu-CASIA NIR-VIS, demonstrating the superiority of the proposed method.

Read full abstract

Intra-class Information Research Articles

Related Topics

Articles published on Intra-class Information

FedCCL: Federated dual-clustered feature contrast under domain heterogeneity

Improving deep metric learning via self-distillation and online batch diffusion process

Feature fusion network based on few-shot fine-grained classification.

From Instance to Metric Calibration: A Unified Framework for Open-World Few-Shot Learning.

Local Correlation Ensemble with GCN Based on Attention Features for Cross-domain Person Re-ID

Supervised Regularized Multidimensional Scaling Using Weighted Stress Measure

Multi-view learning with privileged weighted twin support vector machine

Identification of Epileptic EEG Signals Through TSK Transfer Learning Fuzzy System.

Graph-Based Dimensionality Reduction for Hyperspectral Imagery: A Review

Sentiment analysis of Japanese text and vocabulary learning based on natural language processing and SVM

Joint Feature Disentanglement and Hallucination for Few-Shot Image Classification.

ArcVein-Arccosine Center Loss for Finger Vein Verification

SAR Target Recognition via Joint Sparse and Dense Representation of Monogenic Signal

Class Information-Based Band Selection for Hyperspectral Image Classification

Video Face Recognition Using Siamese Networks With Block-Sparsity Matching

Robust Distracter-Resistive Tracker via Learning a Multi-Component Discriminative Dictionary

Weakly supervised segment annotation via expectation kernel density estimation

Discriminant Deep Feature Learning based on joint supervision Loss and Multi-layer Feature Fusion for heterogeneous face recognition

Block kernel nonnegative matrix factorization for face recognition

Feature Learning Using Spatial-Spectral Hypergraph Discriminant Analysis for Hyperspectral Image.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Intra-class Information Research Articles

Related Topics

Articles published on Intra-class Information

FedCCL: Federated dual-clustered feature contrast under domain heterogeneity

Improving deep metric learning via self-distillation and online batch diffusion process

Feature fusion network based on few-shot fine-grained classification.

From Instance to Metric Calibration: A Unified Framework for Open-World Few-Shot Learning.

Local Correlation Ensemble with GCN Based on Attention Features for Cross-domain Person Re-ID

Supervised Regularized Multidimensional Scaling Using Weighted Stress Measure

Multi-view learning with privileged weighted twin support vector machine

Identification of Epileptic EEG Signals Through TSK Transfer Learning Fuzzy System.

Graph-Based Dimensionality Reduction for Hyperspectral Imagery: A Review

Sentiment analysis of Japanese text and vocabulary learning based on natural language processing and SVM

Joint Feature Disentanglement and Hallucination for Few-Shot Image Classification.

ArcVein-Arccosine Center Loss for Finger Vein Verification

SAR Target Recognition via Joint Sparse and Dense Representation of Monogenic Signal

Class Information-Based Band Selection for Hyperspectral Image Classification

Video Face Recognition Using Siamese Networks With Block-Sparsity Matching

Robust Distracter-Resistive Tracker via Learning a Multi-Component Discriminative Dictionary

Weakly supervised segment annotation via expectation kernel density estimation

Discriminant Deep Feature Learning based on joint supervision Loss and Multi-layer Feature Fusion for heterogeneous face recognition

Block kernel nonnegative matrix factorization for face recognition

Feature Learning Using Spatial-Spectral Hypergraph Discriminant Analysis for Hyperspectral Image.