Multi-granularity Cross Transformer Network for person re-identification

Yanping Li,Duoqian Miao,Hongyun Zhang,Jie Zhou,Cairong Zhao

doi:10.1016/j.patcog.2024.110362

Abstract

Person re-identification (Re-ID) aims to retrieve the same person in the gallery. Transformers have been introduced to the Re-ID task due to their excellent ability to model long-range dependency. However, due to the properties of the global attention mechanism, they are less effective in capturing the discriminative local semantics of pedestrians compared to convolutional operations. To address this issue, we present a Multi-granularity Cross Transformer Network (MCTN) that progressively learns salient features of different local structures in a global context. Specifically, we first utilize a Multi-granularity Convolutional Layer (MCL) to investigate salient pedestrian features at various granularities. On this basis, we propose a Pyramidal Cross Transformer learning layer (PCT), which contains a pyramidal division of pedestrian image feature maps, differentiated feature extraction of different parts of pedestrians, and cross attention to exploring the local–global relationship of the feature map. It allows effective mining of local information in the global structure from a coarse-to-fine perspective. Furthermore, to enhance the interaction between low-level detailed features and high-level semantic features, a Hierarchical Aggregation Strategy (HAS) is introduced to fuse features learned by cross attention learning at different stages. Pedestrian features learned in shallow layers will serve as global priors for semantics learning in deep layers. We evaluate our method on four large-scale Re-ID datasets, and the experimental results reveal that the proposed method outperforms the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-granularity Cross Transformer Network for person re-identification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Feb 21, 2024
Citations: 3

Similar Papers

Adaptive spatial scale person reidentification
Shengyu Pei ... Xinyu Fan
Journal of Electronic Imaging | VOL. 30
Shengyu Pei, et. al.Shengyu Pei ... Xinyu Fan
09 Jan 2021
Journal of Electronic Imaging | VOL. 30

Comparative study of trophic and elemental characteristics of zooplankton in deep (500–3500 m) and shallow (0–200 m) layers
Lei Chen ... Xiaocheng Wang
Deep Sea Research Part I: Oceanographic Research Papers | VOL. 142
Lei Chen, et. al.Lei Chen ... Xiaocheng Wang
24 Oct 2018
Deep Sea Research Part I: Oceanographic Research Papers | VOL. 142

Two‐level salient feature complementary network for person re‐identification
Haishun Du ... Dongdong Huo
International Journal of Intelligent Systems | VOL. 37
Haishun Du, et. al.Haishun Du ... Dongdong Huo
14 Jan 2022
International Journal of Intelligent Systems | VOL. 37

Person re-identification network based on weight-driven saliency hierarchical utilization
Pu Yan ... Yue Fang
Journal of Electronic Imaging | VOL. 31
Pu Yan, et. al.Pu Yan ... Yue Fang
18 May 2022
Journal of Electronic Imaging | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-granularity Cross Transformer Network for person re-identification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition