Abstract

Unsupervised person re-identification (re-id) is an effective analysis for video surveillance in practice, which can train a pedestrian matching model without any annotations, and it is easy to deploy in unseen camera scenarios. The most challenging problem in unsupervised re-id task is the huge distribution-gap among different camera views, and the intrinsic correlations in unlabeled identities are also complicated to sufficiently explored. This paper proposes a Multi-order Cross-view Graph adversarial Network (MCGN) to bridge the cross-view distribution-gap, and mine the inherent discriminative information by multi-order triplet correlations. Specifically, MCGN firstly exploits graph representations by a cross-view graph convolutional network according to intra-view and inter-view graph structure, and then encodes each pedestrian image into a view-shared feature space, which is iteratively trained by a graph generative adversarial learning strategy to deeply bridge the distribution-gap. Finally, this paper proposes a multi-order discriminative learning module for composing reasonable triplet samples according to multi-order similarity correlations among unlabeled pedestrian images. Furthermore, sufficient experiments are conducted in two large scale person re-id datasets (Market-1501 and DukeMTMC-reID). The comparison to state-of-the-art methods and ablation study demonstrate the superiority of MCGN and the contribution of each module proposed in this paper.

Highlights

  • Person re-identification is one of significant technologies for intelligent analysis of video surveillance

  • For the purpose of effectively bridging cross-view gaps and exploiting multi-order correlations, this paper develops a novel unsupervised re-id framework of M ulti-order Cross-view Graph adversarial N etwork (MCGN), which contains three modules for unsupervised re-identification

  • VOLUME 9, 2021 the view-inference; After learning cross-view graph representations, this paper introduces a multi-order discriminative feature learning module to guide Multi-order Cross-view Graph adversarial Network (MCGN) for mining negative and positive correlations following multiple similarity orders, pulling similar samples together and pushing dissimilar ones away

Read more

Summary

Introduction

Person re-identification (re-id) is one of significant technologies for intelligent analysis of video surveillance. Supervised re-id models require a great deal of annotated pedestrian cross-view data to provide training cues, that severely confines the flexibility of practical re-id process [32]. The major drawbacks of supervised re-id models are summarized into two aspects: (1) marking sufficient training data among non-overlapped camera views is so tremendous that leads sweatful to human, which is easy to produce unreliable pedestrian annotations; (2) Identifying pedestrian identities across cameras will cost expensive manual labor, economical spending and time, because cross-view variations are extremely untoward to overcome. The supervised re-id methods will lose effectiveness and produce poor performance on a unseen scenario. In other words, they are incapable to conduct re-id on a great

Objectives
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call