Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification

Weipeng Hu,Yanke Hou,Haifeng Hu,Haitang Zeng,Bohong Liu

doi:10.1109/tcsvt.2022.3147813

Abstract

Visible-infrared person re-identification (RGB-IR ReID) has now attracted increasing attention due to its surveillance applications under low-light environments. However, the large intra-class variations between different domains are still a challenging issue in the field of computer vision. To address the above issue, we propose a novel adversarial Decoupling and Modality-invariant Representation learning (DMiR) method to explore potential spectrum-invariant yet identity-discriminative representations for cross-modality pedestrians. Our model consists of three key components, including Domain-related Representation Disentanglement (DrRD), Modality-invariant Discriminative Representation (MiDR) and Representation Orthogonal Decorrelation (ROD). First, two subnets named Identity-Net and Domain-Net are designed to extract identity-related features and domain-related features, respectively. Given this two-stream structure, the DrRD is introduced to achieve adversarial decoupling against domain-specific features via a min-max disentanglement process. Specifically, the classification objective function on Domain-Net is minimized to extract spectrum-specific information while maximizing it to reduce domain-specific information. Second, in Identity-Net, we introduce MiDR to enhance intra-class compactness and reduce domain variations by exploring positive and negative pair variations, semantic-wise differences, and pair-wise semantic variations. Finally, the correlation between the two decomposed features, i.e., identity-related features and domain-related features, may lead to the introduction of modal information in identity representations, and vice versa. Therefore, we present the ROD constraint to make the two decomposed features unrelated to each other, which can more effectively separate the two-component features and enhance feature representations. Practically, we construct ROD at the feature-level and parameter-level, and finally select feature-level ROD as the decorrelation strategy because of its superior decorrelation performance. The whole scheme leads to disentangling spectrum-dependent information, as well as purifying identity information. Extensive experiments are carried out on two mainstream RGB-IR ReID datasets, and the results demonstrate the effectiveness of our method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Aug 1, 2022
Citations: 37

Similar Papers

Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
Xuyang Wang ... Yongquan Fan
Applied Sciences | VOL. 13
Xuyang Wang, et. al.Xuyang Wang ... Yongquan Fan
16 Jan 2023
Applied Sciences | VOL. 13

Fine grained food image recognition based on swin transformer
Zhiyong Xiao ... Zhaohong Deng
Journal of Food Engineering | VOL. 380
Zhiyong Xiao, et. al.Zhiyong Xiao ... Zhaohong Deng
16 May 2024
Journal of Food Engineering | VOL. 380

Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition
Weipeng Hu ... Haifeng Hu
IEEE Transactions on Multimedia | VOL. 23
Weipeng Hu, et. al.Weipeng Hu ... Haifeng Hu
17 Mar 2020
IEEE Transactions on Multimedia | VOL. 23

Identity–Expression Dual Branch Network for Facial Expression Recognition
Haifeng Zhang ... Zengfu Wang
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13
Haifeng Zhang, et. al.Haifeng Zhang ... Zengfu Wang
30 Oct 2020
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology