Abstract

Person re-identification is to retrieval pedestrian images from no-overlap camera views detected by pedestrian detectors. Most existing person re-identification (re-ID) models often fail to generalize well from the source domain where the models are trained to a new target domain without labels, because of the bias between the source and target domain. This issue significantly limits the scalability and usability of the models in the real world. Providing a labeled source training set and an unlabeled target training set, the aim of this paper is to improve the generalization ability of re-ID models to the target domain. To this end, we propose an image generative network named identity preserving generative adversarial network (IPGAN). The proposed method has two excellent properties: 1) only a single model is employed to translate the labeled images from the source domain to the target camera domains in an unsupervised manner; 2) The identity information of images from the source domain is preserved before and after translation. Furthermore, we propose IBN-reID model for the person re-identification task. It has better generalization ability than baseline models, especially in the cases without any domain adaptation. The IBN-reID model is trained on the translated images by supervised methods. Experimental results on Market-1501 and DukeMTMC-reID show that the images generated by IPGAN are more suitable for cross-domain person re-identification. Very competitive re-ID accuracy is achieved by our method.

Highlights

  • As one of the most challenging problems in the field of computer vision [1]–[6], the person re-identification(re-ID) task aims at matching the relevant images across no-overlap camera views at different locations and times

  • We propose a multi-domain image-toimage style translation approach, termed Identity Preserving Generative Adversarial Network (IPGAN)

  • The inputs of Dsem are the style-translated images from source domain and the identity labels of images from the source domain

Read more

Summary

Introduction

As one of the most challenging problems in the field of computer vision [1]–[6], the person re-identification(re-ID) task aims at matching the relevant images across no-overlap camera views at different locations and times. It plays a crucial role in law enforcement and video surveillance for public safety, such as long-term cross camera tracking and video retrieval. It is expensive and impractical to collect such scale manual labelling in the real-world Those re-ID models often fail to generalize well from a dataset to a new one, because of the feature distribution bias between two different datasets

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.