Pose-guided Human Feature Aggregation for Occluded Person Re-identification

Zhe Zhang,Zongwen Bai,Meili Zhou

doi:10.62517/jbdc.202301407

Abstract

Since the appearance of most pedestrians is often obscured by various obstacles. Some existing works solve the occlusion problem by aligning the query image of the target pedestrian with the body part of the gallery image, but the body structure of the pedestrian is complicated and not easy to align. Therefore, this paper introduces a Human Feature Aggregation (HFA) approach based on Transformer without alignment, which uses pose information to separate the body parts of target pedestrians from the occlusion. This method utilizes pose information to separate the body parts of the target pedestrian from the obstructions. Initially, the Vision Transformer incorporates Convolutional Neural Network (CNN) advantages to enhance extraction more fine-grained global and local features. Subsequently, the body parts of the target pedestrian are separated from the obstructions using pose information extracted by a pose estimator. Finally, in the human feature aggregation module, local features are matched and fused with pose information to enrich the human features. It steers the model towards focus more on body parts. The experimental findings indicate that the proposed HFA approach surpasses alternative methods on multiple benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pose-guided Human Feature Aggregation for Occluded Person Re-identification

Abstract

Talk to us

Similar Papers

More From: Journal of Big Data and Computing

Lead the way for us

Journal: Journal of Big Data and Computing	Publication Date: Dec 1, 2023
License type: CC BY 4.0

Similar Papers

CNN classification based on global and local features
Yufeng Zheng ... Matthias F Carlsohn
-
Yufeng Zheng, et. al.Yufeng Zheng ... Matthias F Carlsohn
14 May 2019
14 May 2019

Joint Coding of Local and Global Deep Features in Videos for Visual Search.
Lin Ding ... Yonghong Tian
IEEE Transactions on Image Processing | VOL. 29
Lin Ding, et. al.Lin Ding ... Yonghong Tian
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 29

Processing global and local features in convolutional neural network (CNN) and primate visual systems
Jun Huang ... Yang Ou
-
Jun Huang, et. al.Jun Huang ... Yang Ou
14 May 2018
14 May 2018

HCNN: A Neural Network Model for Combining Local and Global Features Towards Human-Like Classification
Tielin Zhang ... Bo Xu
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 30
Tielin Zhang, et. al.Tielin Zhang ... Bo Xu
30 Dec 2015
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pose-guided Human Feature Aggregation for Occluded Person Re-identification

Abstract

Talk to us

Similar Papers

More From: Journal of Big Data and Computing