CityPersons Dataset Research Articles

Variants of DEtection TRansformer (DETRs) have shown promising performance in crowded pedestrian detection. However, we observe that DETRs are sensitive to the hyper-parameter (the number of queries). Adjusting this hyper-parameter is crucial for achieving competitive performance across different crowded pedestrian datasets. Existing query generation methods are limited to generate a fixed number of queries based on this hyper-parameter, which often leads to missed detections and incorrect detections due to the varied number and density of pedestrians in crowded scenes. To address this challenge, we propose an adaptive query generation method called Ranking-based Adaptive Query Generation (RAQG). RAQG comprises three components: a ranking prediction head, a query supplementer, and Soft Gradient L1 Loss (SGL1). Specifically, we leverage the ranking of the lowest confidence score positive training sample to generate queries adaptively. The ranking prediction head predicts this ranking, which guides our query generation. Additionally, to refine the query generation process, we introduce a query supplementer that adjusts the number of queries based on the predicted ranking. Furthermore, we introduce SGL1, a novel loss function for training the ranking prediction head over a wide regression range. Our method is designed to be lightweight and universal, suitable for integration into any DETRs framework for crowded pedestrian detection. Experimental results on Crowdhuman and Citypersons datasets demonstrate that our RAQG method can generate queries adaptively and achieves competitive results. Notably, our approach achieves a state-of-the-art 39.4% MR on Crowdhuman.

Read full abstract

In real-world scenarios (i.e., in the wild), pedestrians are often far from the camera (i.e., small scale), and they often gather together and occlude with each other (i.e., heavily occluded). However, detecting these small-scale and heavily occluded pedestrians remains a challenging problem for the existing pedestrian detection methods. We argue that these problems arise because of two factors: 1) insufficient resolution of feature maps for handling small-scale pedestrians and 2) lack of an effective strategy for extracting body part information that can directly deal with occlusion. To solve the above-mentioned problems, in this article, we propose a key-point-guided super-resolution network (coined KGSNet) for detecting these small-scale and heavily occluded pedestrians in the wild. Specifically, to address factor 1), a super-resolution network is first trained to generate a clear super-resolution pedestrian image from a small-scale one. In the super-resolution network, we exploit key points of the human body to guide the super-resolution network to recover fine details of the human body region for easier pedestrian detection. To address factor 2), a part estimation module is proposed to encode the semantic information of different human body parts where four semantic body parts (i.e., head and upper/middle/bottom body) are extracted based on the key points. Finally, based on the generated clear super-resolved pedestrian patches padded with the extracted semantic body part images at the image level, a classification network is trained to further distinguish pedestrians/backgrounds from the inputted proposal regions. Both proposed networks (i.e., super-resolution network and classification network) are optimized in an alternating manner and trained in an end-to-end fashion. Extensive experiments on the challenging CityPersons data set demonstrate the effectiveness of the proposed method, which achieves superior performance over previous state-of-the-art methods, especially for those small-scale and heavily occluded instances. Beyond this, we also achieve state-of-the-art performance (i.e., 3.89% MR-2 on the reasonable subset) on the Caltech data set.

Read full abstract

CityPersons Dataset Research Articles

Articles published on CityPersons Dataset

An enhanced lightweight model for small-scale pedestrian detection based on YOLOv8s

Distilled Center and Scale Prediction: Distilling anchor-free pedestrian detector for edge computing

Research on Deep Learning Detection Model for Pedestrian Objects in Complex Scenes Based on Improved YOLOv7.

Ranking-based adaptive query generation for DETRs in crowded pedestrian detection

YOLOv7-PD: Incorporating DE-ELAN and NWD-CIoU for Advanced Pedestrian Detection Method

Improved YOLOv3 Integrating SENet and Optimized GIoU Loss for Occluded Pedestrian Detection.

Multi-Attribute NMS: An Enhanced Non-Maximum Suppression Algorithm for Pedestrian Detection in Crowded Scenes

THFE: A Triple-hierarchy Feature Enhancement method for tiny boat detection

CAPNet: Context and Attribute Perception for Pedestrian Detection

Correlation-and-Correction Fusion Attention Network for Occluded Pedestrian Detection

SSAD: Single-Shot Multi-Scale Attentive Detector for Autonomous Driving

Pedestrian Detection Using Integrated Aggregate Channel Features and Multitask Cascaded Convolutional Neural-Network-Based Face Detectors.

Pedestrian Fall Event Detection in Complex Scenes Based on Attention-Guided Neural Network

Region Resolution Learning and Region Segmentation Learning with Overall and Body Part Perception for Pedestrian Detection

Cascaded Cross-Layer Fusion Network for Pedestrian Detection

A novel visibility semantic feature-aided pedestrian detection scheme for autonomous vehicles

Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern.

KGSNet: Key-Point-Guided Super-Resolution Network for Pedestrian Detection in the Wild.

Occluded pedestrian detection combined with semantic features

Coupled Network for Robust Pedestrian Detection With Gated Multi-Layer Feature Extraction and Deformable Occlusion Handling.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

CityPersons Dataset Research Articles

Articles published on CityPersons Dataset

An enhanced lightweight model for small-scale pedestrian detection based on YOLOv8s

Distilled Center and Scale Prediction: Distilling anchor-free pedestrian detector for edge computing

Research on Deep Learning Detection Model for Pedestrian Objects in Complex Scenes Based on Improved YOLOv7.

Ranking-based adaptive query generation for DETRs in crowded pedestrian detection

YOLOv7-PD: Incorporating DE-ELAN and NWD-CIoU for Advanced Pedestrian Detection Method

Improved YOLOv3 Integrating SENet and Optimized GIoU Loss for Occluded Pedestrian Detection.

Multi-Attribute NMS: An Enhanced Non-Maximum Suppression Algorithm for Pedestrian Detection in Crowded Scenes

THFE: A Triple-hierarchy Feature Enhancement method for tiny boat detection

CAPNet: Context and Attribute Perception for Pedestrian Detection

Correlation-and-Correction Fusion Attention Network for Occluded Pedestrian Detection

SSAD: Single-Shot Multi-Scale Attentive Detector for Autonomous Driving

Pedestrian Detection Using Integrated Aggregate Channel Features and Multitask Cascaded Convolutional Neural-Network-Based Face Detectors.

Pedestrian Fall Event Detection in Complex Scenes Based on Attention-Guided Neural Network

Region Resolution Learning and Region Segmentation Learning with Overall and Body Part Perception for Pedestrian Detection

Cascaded Cross-Layer Fusion Network for Pedestrian Detection

A novel visibility semantic feature-aided pedestrian detection scheme for autonomous vehicles

Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern.

KGSNet: Key-Point-Guided Super-Resolution Network for Pedestrian Detection in the Wild.

Occluded pedestrian detection combined with semantic features

Coupled Network for Robust Pedestrian Detection With Gated Multi-Layer Feature Extraction and Deformable Occlusion Handling.