Abstract

Most supervised person re-identification methods show their excellent performance, but using labeled datasets is very expensive, which limits its application in practical scenarios. To solve the scalability problem, we propose a Cross-camera Erased Feature Learning (CEFL) framework for unsupervised person re-identification that learns discriminative features from image appearances without manual annotations, where both of the cross-camera global image appearance and the local details are explored. Specifically, for the global appearance, in order to bridge the gap between images with the same identities under different cameras, we generate style-transferred images. The network is trained to classify the original images, the style-transferred images and the negative samples. To learn the partial details of the images, we generate erased images and train the network to pull the similar erased images together and push the dissimilar ones away. In addition, we joint learn the discriminative global and local information to learn a more robust model. Global and erased features are used together in feature learning which are successful conjunction of BFENet. A large number of experiments show the superiority of CEFL in unsupervised pedestrian re-identification.

Highlights

  • Person re-identification is a key technology in urban monitoring, which is used in public places to find the target pedestrian in video or images taken by different cameras [1]

  • camera Erased Feature Learning (CEFL), feature space and farther away from different classes to enhance the discrimination of elements we propose to drawwe similar erased closer erased while pushing features away in

  • This paper proposes an unsupervised Cross-camera Erased Feature Learning (CEFL) model and proves the effectiveness of CEFL in unsupervised person re-identification

Read more

Summary

Introduction

Person re-identification (re-ID) is a key technology in urban monitoring, which is used in public places to find the target pedestrian in video or images taken by different cameras [1]. Most current person re-identification methods require tremendous amount of labeled images, which is very difficult and expensive. It limits the scalability and usability in practical application scenarios. These methods usually adopt unsupervised clustering [12,13,14,15] or generative adversarial networks [7,9] to augment datasets. Another branch of unsupervised learning methods [1,7,8,9,10,11]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.