Counting with Self-Weighted Multi-Scale Fusion Networks

Xin Xiong,Ying Li,Peng Li,Wenjie Yan,Jie Shen,Wei He

doi:10.1142/s0218001423550078

Abstract

Because of the large-scale variation, counting in scenes of different densities is an extremely difficult task. In this paper, based on the attention mechanism, we propose a new self-weighted multi-scale fusion network structure named SMFNet to solve the problem of multi-scale changes and can significantly improve the effect of crowd counting in monitoring scene. The proposed SMFNet uses VGG as the backbone network to extract multi-scale features, uses a SMFNet as the neck to fuse multiple-scale features, and uses the atrous spatial pyramid pooling (ASPP) network and ordinary convolution as the head to generate both the attention map and the density map. The attention map highlighting crowd regions in the image contributes to a high-quality density map, and the density map records the crowd distribution. The number of crowd in the image can be obtained by summing the pixel values of the density map. We conduct experiments on three crowd counting datasets and one vehicle counting dataset to show that our proposed SMFNet can improve the state-of-the-art counting methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Counting with Self-Weighted Multi-Scale Fusion Networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Similar Papers

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

-

29 Dec 2020
29 Dec 2020

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting
Pongpisit Thanasutives ... Boonserm Kijsirikul
-
Pongpisit Thanasutives, et. al.Pongpisit Thanasutives ... Boonserm Kijsirikul
10 Jan 2021
10 Jan 2021

Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting
Yanyan Fang ... Bo Hu
Neurocomputing | VOL. 392
Yanyan Fang, et. al.Yanyan Fang ... Bo Hu
25 Jan 2020
Neurocomputing | VOL. 392

ACCNet: Attention-based Contextual Convolutional Network for Crowd Counting
Yaoying Huang ... Aichun Zhu
-
Yaoying Huang, et. al.Yaoying Huang ... Aichun Zhu
06 Nov 2020
06 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Counting with Self-Weighted Multi-Scale Fusion Networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence