Network For Crowd Counting Research Articles

Crowd-counting networks have become the mainstream method to deploy crowd-counting techniques on resource-constrained devices. Significant progress has been made in this field, with many outstanding lightweight models being proposed successively. However, challenges like scare-variation, global feature extraction, and fine-grained head annotation requirements still exist in relevant tasks, necessitating further improvement. In this article, we propose a weakly supervised hybrid lightweight crowd-counting network that integrates the initial layers of GhostNet as the backbone to efficiently extract local features and enrich intermediate features. The incorporation of a modified Swin-Transformer block addresses the need for effective global context information. A Pyramid Pooling Aggregation Module handles the inherent scale variation problem in crowd-counting tasks in a more computation-efficient way. This module, along with the cross-attention module, serves as bridges to promote the feature information flow between local features and global context information. Finally, a simplified regressor module is designed to allow the proposed model with weakly supervised guidance for training to avoid precise location-level annotations, noting that the omission of density map generation makes the proposed network more lightweight. Our results on the UCF-QNRF dataset indicate our model is 8.73% and 12.17% more accurate on MAE and MSE metrics, respectively, than the second-best ARNet, in which the parameters decrease by 4.52%. On the ShanghaiTech A dataset, MAE and MSE drop 1.5% and 3.2%, respectively, compared to the second-best PDDNet. The experimental results for accuracy and inference speed evaluation on some mainstream datasets validate the effective design principle of our model.

Read full abstract

Crowd counting aims to estimate the number, density, and distribution of crowds in an image. The current mainstream approach, based on CNN, has been highly successful. However, CNN is not without its flaws. Its limited receptive field hampers the modeling of global contextual information, and it struggles to effectively handle scale variation and background complexity. In this paper, we propose a Multi-scale Hybrid Attention Network called MHANet to solve crowd counting challenges more effectively. To address the issue of scale variation, we have developed a Multi-scale Aware Module (MAM) that incorporates multiple sets of dilated convolutions with varying dilation rates. The MAM significantly improves the network’s ability to extract information at multiple scales. To tackle the problem of background complexity, we have introduced a Hybrid Attention Module (HAM) that combines spatial attention and channel attention. The HAM effectively directs attention to the crowd region while suppressing background interference, resulting in more accurate counting. MHANet has been extensively experimented on four benchmark datasets and compared against state-of-the-art algorithms. It consistently achieves superior performance in terms of the MAE evaluation metric. MHANet outperforms the current state-of-the-art methods by margins of 1.9%, 5.4%, 0.4%, and 0.8% on the ShanghaiTech Part_A, ShanghaiTech Part_B, UCF-QNRF, and UCF_CC_50 datasets, respectively. Furthermore, a series of ablation experiments targeting MAM and HAM were conducted in this paper, and the experimental results fully demonstrate that MAM and HAM can effectively address the challenges of scale variation and background complexity, ultimately enhancing the accuracy and robustness of the network.

Read full abstract

Network For Crowd Counting Research Articles

Related Topics

Articles published on Network For Crowd Counting

Progressive Crowd Enhancement De-Background Network for crowd counting

TinyCount: an efficient crowd counting network for intelligent surveillance

SDANet: scale-deformation awareness network for crowd counting

CrowdUNet: Segmentation assisted U-shaped crowd counting network

Multi-branch progressive embedding network for crowd counting

A multi-scale fusion and dual attention network for crowd counting

Transformer-CNN hybrid network for crowd counting

An encoder-decoder network for crowd counting based on multi-scale attention mechanism

Double multi-scale feature fusion network for crowd counting

A Weakly Supervised Hybrid Lightweight Network for Efficient Crowd Counting

JMFEEL-Net: a joint multi-scale feature enhancement and lightweight transformer network for crowd counting

Improving MLP-Based Weakly Supervised Crowd-Counting Network via Scale Reasoning and Ranking

Multi-branch Segmentation-guided Attention Network for crowd counting

SFPANet: Separation and fusion pyramid attention network for crowd counting

Dual convolutional neural network for crowd counting

MHANet: Multi-scale hybrid attention network for crowd counting

Deformable channel non‐local network for crowd counting

Context Attention Fusion Network for crowd counting

GTL-ASENet: global to local adaptive spatial encoder network for crowd counting

FPANet: feature pyramid attention network for crowd counting

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Network For Crowd Counting Research Articles

Related Topics

Articles published on Network For Crowd Counting

Progressive Crowd Enhancement De-Background Network for crowd counting

TinyCount: an efficient crowd counting network for intelligent surveillance

SDANet: scale-deformation awareness network for crowd counting

CrowdUNet: Segmentation assisted U-shaped crowd counting network

Multi-branch progressive embedding network for crowd counting

A multi-scale fusion and dual attention network for crowd counting

Transformer-CNN hybrid network for crowd counting

An encoder-decoder network for crowd counting based on multi-scale attention mechanism

Double multi-scale feature fusion network for crowd counting

A Weakly Supervised Hybrid Lightweight Network for Efficient Crowd Counting

JMFEEL-Net: a joint multi-scale feature enhancement and lightweight transformer network for crowd counting

Improving MLP-Based Weakly Supervised Crowd-Counting Network via Scale Reasoning and Ranking

Multi-branch Segmentation-guided Attention Network for crowd counting

SFPANet: Separation and fusion pyramid attention network for crowd counting

Dual convolutional neural network for crowd counting

MHANet: Multi-scale hybrid attention network for crowd counting

Deformable channel non‐local network for crowd counting

Context Attention Fusion Network for crowd counting

GTL-ASENet: global to local adaptive spatial encoder network for crowd counting

FPANet: feature pyramid attention network for crowd counting