Abstract

Occlusion is unavoidable in real-world applications of person re-identification (ReID). To alleviate the occlusion problem, this work proposes the detection of the occluded and visible regions of the human body by suppressing the occluded region during feature generation and matching, and enhancing the significance of the visible region. This paper introduces a novel method based on pose-guided spatial attention (PGSA) and activation-based attention (AA) called dual-attention re-identification (DAReID). DAReID consists of a mask branch and a global branch and uses ResNet-50 as the backbone network. The mask branch uses PGSA to obtain the visible and occluded regions of a person and constructs pose guided coarse labels for the occluded region through keypoints of the human body, driving the network to obtain robust local features. The global branch obtains the visual activation levels of different regions through AA, and combines this with human pose information to define weighted local distances(WLD). The WLD learning strategy is applied to drive the network to learn new and more discriminative local features. Experimental results show that DAReID achieves comparable performance on the Market1501, DukeMTMC-reID, and CUHK-03 datasets. And on the Occluded-DukeMTMC dataset, DAReID outperforms the existing methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call