Cross-modality Person Re-identification Research Articles

Cross-modality recognition has many important applications in science, law enforcement and entertainment. Popular methods to bridge the modality gap include reducing the distributional differences of representations of different modalities, learning indistinguishable representations or explicit modality transfer. The first two approaches suffer from the loss of discriminant information while removing the modality-specific variations. The third one heavily relies on the successful modality transfer, could face catastrophic performance drop when explicit modality transfers are not possible or difficult. To tackle this problem, we proposed a compact encoder-decoder neural module (cmUNet) to learn modality-agnostic representations while retaining identity-related information. This is achieved through cross-modality transformation and in-modality reconstruction, enhanced by an adversarial/perceptual loss which encourages indistinguishability of representations in the original sample space. For cross-modality matching, we propose MarrNet where cmUNet is connected to a standard feature extraction network which takes as inputs the modality-agnostic representations and outputs similarity scores for matching. We validated our method on five challenging tasks, namely Raman-infrared spectrum matching, cross-modality person re-identification and heterogeneous (photo-sketch, visible-near infrared and visible-thermal) face recognition, where MarrNet showed superior performance compared to state-of-the-art methods. Furthermore, it is observed that a cross-modality matching method could be biased to extract discriminant information from partial or even wrong regions, due to incompetence of dealing with modality gaps, which subsequently leads to poor generalization. We show that robustness to occlusions can be an indicator of whether a method can well bridge the modality gap. This, to our knowledge, has been largely neglected in the previous works. Our experiments demonstrated that MarrNet exhibited excellent robustness against disguises and occlusions, and outperformed existing methods with a large margin (>10%). The proposed cmUNet is a meta-approach and can be used as a building block for various applications.

Visible-infrared person re-identification aims to match pedestrian images between visible and infrared modalities, and its two main challenges are intra-modality differences and cross-modality differences between visible and infrared images. To address these issues, many advanced methods attempt to design new network structures to extract modality-sharing features, mitigate modality differences, or learn part-level features to overcome background interference. However, they ignore the parameter sharing of the convolutional layers to obtain more modality-sharing features. At the same time, only using part-level features lack discriminative pedestrian representations such as body structure and contours. To handle these problems, a parameter sharing and feature learning network is proposed in this paper to mitigate modality differences and further enhance feature discrimination. Firstly, a new two-stream parameter sharing network is proposed, by sharing the convolutional layers parameters to obtain more modality-sharing features. Secondly, a multi-granularity feature learning module is designed to reduce modality differences at both coarse and fine-grained levels while further enhancing feature discriminability. In addition, a center alignment loss is proposed to learn relationships between identities and to reduce modality differences by clustering features into their centers. For the part-level feature learning, the hetero-center triplet loss is adopted to alleviate the strict constraints of triplet loss. Finally, extensive experiments are conducted to validate our method outperforms state-of-the-art methods on two challenging datasets. In the SYSU-MM01 dataset, the Rank-1 and mAP reach 74.0%\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$74.0\\%$$\\end{document} and 70.51%\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$70.51\\%$$\\end{document} in the all-search mode, which is an increase of 3.4%\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$3.4\\%$$\\end{document} and 3.61%\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$3.61\\%$$\\end{document} to baseline, respectively.

Cross-modality Person Re-identification Research Articles

Related Topics

Articles published on Cross-modality Person Re-identification

Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement.

Bridge Gap in Pixel and Feature Level for Cross-Modality Person Re-Identification

IGIE-net: Cross-modality person re-identification via intermediate modality image generation and discriminative information enhancement

Homogeneous and Heterogeneous Optimization for Unsupervised Cross-Modality Person Reidentification in Visual Internet of Things

Enhanced Invariant Feature Joint Learning via Modality-Invariant Neighbor Relations for Cross-Modality Person Re-Identification

Cross modality person re-identification via mask-guided dynamic dual-task collaborative learning

Cross-modality person re-identification based on intermediate modal generation

Mind the Gap: Learning Modality-Agnostic Representations With a Cross-Modality UNet.

A Generative-Based Image Fusion Strategy for Visible-Infrared Person Re-Identification

Co-segmentation assisted cross-modality person re-identification

Parameter sharing and multi-granularity feature learning for cross-modality person re-identification

MSIF: multi-spectrum image fusion method for cross-modality person re-identification

Cross-Modality Person Re-Identification Algorithm Based on Two-Branch Network

Cross-Modality Person Re-identification with Memory-Based Contrastive Embedding

A Cross-Modality Person Re-Identification Method Based on Joint Middle Modality and Representation Learning

Dynamic feature weakening for cross-modality person re-identification

Cross-Modality Person Re-Identification via Local Paired Graph Attention Network.

Counterfactual attention alignment for visible-infrared cross-modality person re-identification

Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification

Person Tracking by Detection Using Dual Visible-Infrared Cameras

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cross-modality Person Re-identification Research Articles

Related Topics

Articles published on Cross-modality Person Re-identification

Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement.

Bridge Gap in Pixel and Feature Level for Cross-Modality Person Re-Identification

IGIE-net: Cross-modality person re-identification via intermediate modality image generation and discriminative information enhancement

Homogeneous and Heterogeneous Optimization for Unsupervised Cross-Modality Person Reidentification in Visual Internet of Things

Enhanced Invariant Feature Joint Learning via Modality-Invariant Neighbor Relations for Cross-Modality Person Re-Identification

Cross modality person re-identification via mask-guided dynamic dual-task collaborative learning

Cross-modality person re-identification based on intermediate modal generation

Mind the Gap: Learning Modality-Agnostic Representations With a Cross-Modality UNet.

A Generative-Based Image Fusion Strategy for Visible-Infrared Person Re-Identification

Co-segmentation assisted cross-modality person re-identification

Parameter sharing and multi-granularity feature learning for cross-modality person re-identification

MSIF: multi-spectrum image fusion method for cross-modality person re-identification

Cross-Modality Person Re-Identification Algorithm Based on Two-Branch Network

Cross-Modality Person Re-identification with Memory-Based Contrastive Embedding

A Cross-Modality Person Re-Identification Method Based on Joint Middle Modality and Representation Learning

Dynamic feature weakening for cross-modality person re-identification

Cross-Modality Person Re-Identification via Local Paired Graph Attention Network.

Counterfactual attention alignment for visible-infrared cross-modality person re-identification

Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification

Person Tracking by Detection Using Dual Visible-Infrared Cameras