Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search

Wenjie Yang,Houjing Huang,Xiaotang Chen,Kaiqi Huang

doi:10.1109/tcsvt.2021.3058668

Abstract

The key to efficient person search is jointly localizing pedestrians and learning discriminative representation for person re-identification (re-ID). Some recently developed models are built with separate detection and re-ID branches on top of shared region feature extraction networks. There are two factors that are detrimental to re-ID feature learning. One is the background information redundancy resulting from the large receptive field of neurons. The other is the body part missing and background clutter caused by inaccurate localization. In this work, a bottom-up fusion (BUF) subnet is proposed to fuse the bounding box features pooled from multiple network stages. With a few parameters introduced, BUF leverages the multi-level features with various sizes of receptive fields to mitigate the background-bias problem. To further suppress the non-pedestrian regions, the newly introduced segmentation head generates a foreground probability map as guidance for the network to focus on the foreground regions. The resulting foreground attention module (FAM) enhances the foreground features. Moreover, for robust feature learning in practical person search, we propose to adaptively smooth the labels of the pedestrian boxes with consideration of the detection quality. Extensive experiments on PRW and CUHK-SYSU validate the effectiveness of the proposals. Our Bottom-Up Foreground-Aware Feature Fusion (BUFF) network with ALS achieves considerable gains over the state-of-the-art on PRW and competitive performance on CUHK-SYSU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Feb 13, 2021
Citations: 10

Similar Papers

Bottom-Up Foreground-Aware Feature Fusion for Person Search
Wenjie Yang ... Dangwei Li
-
Wenjie Yang, et. al.Wenjie Yang ... Dangwei Li
12 Oct 2020
12 Oct 2020

MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
Liming Zhou ... Yadi Wang
Drones | VOL. 8
Liming Zhou, et. al.Liming Zhou ... Yadi Wang
08 May 2024
Drones | VOL. 8

Discriminative Feature Learning With Consistent Attention Regularization for Person Re-Identification
Sanping Zhou ... Jinjun Wang
-
Sanping Zhou, et. al.Sanping Zhou ... Jinjun Wang
01 Oct 2019
01 Oct 2019

Multi‐scale feature extraction for energy‐efficient object detection in remote sensing images
Di Wu ... Fei Xie
IET Computer Vision | VOL. -
Di Wu, et. al.Di Wu ... Fei Xie
30 Oct 2024
IET Computer Vision | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology