CMPNet: A cross-modal multi-scale perception network for RGB-T crowd counting

Shihui Zhang,Kun Chen,Gangzheng Zhai,He Li,Shaojie Han

doi:10.1016/j.future.2024.107596

Abstract

The cross-modal crowd counting method demonstrates better scene adaptability under complex conditions by introducing independent supplementary information. However, existing methods still face problems such as insufficient fusion of modal features, underutilization of crowd structure, and the neglect of scale information. In response to the above issues, this paper proposes a cross-modal multi-scale perception network (CMPNet). Specifically, CMPNet mainly consists of a cross-modal perception fusion module and a multi-scale feature aggregation module. The cross-modal perception fusion module effectively suppresses noise features while sharing features between different modalities, thereby significantly improving the robustness of the crowd counting process. The multi-scale feature aggregation module obtains rich crowd structure information through a spatial context aware graph convolution unit, and then integrates feature information from different scales to enhance the network’s perception ability of crowd density. To the best of our knowledge, CMPNet is the first attempt to model the crowd structure and mine its semantics in the field of cross-modal crowd counting. The experimental results show that CMPNet achieves state-of-the-art performance on all RGB-T datasets, providing an effective solution for cross-modal crowd counting. We will release the code at https://github.com/KunChenKKK/CMPNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CMPNet: A cross-modal multi-scale perception network for RGB-T crowd counting

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Similar Papers

A Multi-Scale Feature Fusion Network With Cascaded Supervision for Cross-Scene Crowd Counting
Xinfeng Zhang ... Wencong Shan
IEEE Transactions on Instrumentation and Measurement | VOL. 72
Xinfeng Zhang, et. al.Xinfeng Zhang ... Wencong Shan
01 Jan 2023
IEEE Transactions on Instrumentation and Measurement | VOL. 72

Crowd counting via Multi-Scale Adversarial Convolutional Neural Networks
Liping Zhu ... Chengyang Li
Journal of Intelligent Systems | VOL. 30
Liping Zhu, et. al.Liping Zhu ... Chengyang Li
08 Jul 2020
Journal of Intelligent Systems | VOL. 30

A multi-scale and multi-level feature aggregation network for crowd counting
Fushun Zhu ... Zhengyu Zhang
Neurocomputing | VOL. 423
Fushun Zhu, et. al.Fushun Zhu ... Zhengyu Zhang
21 Oct 2020
Neurocomputing | VOL. 423

Multi-scale Attention Recalibration Network for crowd counting
Jinyang Xie ... Hong Liu
Applied Soft Computing | VOL. 117
Jinyang Xie, et. al.Jinyang Xie ... Hong Liu
19 Jan 2022
Applied Soft Computing | VOL. 117

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CMPNet: A cross-modal multi-scale perception network for RGB-T crowd counting

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems