Abstract

Although great progress has been achieved for general object detection with the advent of deep learning, there are still several limitations when applied to pedestrian detection task. In this article, the authors mainly consider the following three problems. (i) most deep learning-based methods are time-consuming and require large storage space, making them unsuitable for real-world applications; (ii) the scale of person varies greatly, which decreases the pedestrian detection performance; (iii) as pedestrians may be occluded by vehicles or other persons, it makes pedestrian detection more challenging. In response to these limitations, this article proposes a multi-receptive field-based framework for single-shot pedestrian detection. Authors' contributions can be summarised as follows. First, to reduce the computational complexity of the detector, the authors design the multi-receptive pooling pyramid module, which reduces the computation cost of the detector and improves its performance. Next, body parts graphs are built on the learned convolutional neural networks (CNN) features, and a graph CNN is used to mine the relationships of the graph nodes. Finally, experiments are conducted on two public pedestrian detection datasets to demonstrate the effectiveness of the proposed method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call