Abstract

As a valuable component of intelligent video surveillance, crowd counting has received lots of attention. In practice, however, crowd counting always suffers from the problem of the scale change of pedestrians. To mitigate this limitation, we propose a novel correlation-attention guided regression network to estimate the number of people, termed CGR-Net. To make the generation process of spatial attention and channel attention independent of each other, we design a parallel channel/spatial-wise attention module (PCSAM) to avoid error accumulation. A pixel-wise assisted attention module (PAAM) is developed for learning crowd uneven distribution on the different image pixels to further enhance the ability of the CGR-Net. Furthermore, we present a new loss function to ensure the effectiveness and performance of the proposed method. Comprehensive experimental results demonstrate that our model delivers enhanced representation and attains state-of-the-art performance.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call