ST-CNN: Spatial-Temporal Convolutional Neural Network for crowd counting in videos

Yunqi Miao,Jungong Han,Yongsheng Gao,Baochang Zhang

doi:10.1016/j.patrec.2019.04.012

Abstract

The task of crowd counting and density maps estimating from videos is challenging due to severe occlusions, scene perspective distortions and diverse crowd distributions. Conventional crowd counting methods via deep learning technique process each video frame independently with no consideration of the intrinsic temporal correlation among neighboring frames, thus making the performance lower than the required level of real-world applications. To overcome this shortcoming, a new end-to-end deep architecture named Spatial-Temporal Convolutional Neural Network (ST-CNN) is proposed, which unifies 2D convolutional neural network (C2D) and 3D convolutional neural network (C3D) to learn spatial-temporal features in the same framework. On top of that, a merging scheme is performed on the resulting density maps, taking advantages of the spatial-temporal information simultaneously for the crowd counting task. Experimental results on two benchmark data sets â Mall dataset and WorldExpo′10 dataset show that our ST-CNN outperforms the state-of-the-art models in terms of mean absolutely error (MAE) and mean squared error (MSE).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ST-CNN: Spatial-Temporal Convolutional Neural Network for crowd counting in videos

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Apr 16, 2019
Citations: 51

Similar Papers

A survey of crowd counting and density estimation based on convolutional neural network
Zizhu Fan ... Yaowei Wang
Neurocomputing | VOL. 472
Zizhu Fan, et. al.Zizhu Fan ... Yaowei Wang
08 Nov 2021
Neurocomputing | VOL. 472

A survey of recent advances in CNN-based single image crowd counting and density estimation
Vishwanath A Sindagi ... Vishal M Patel
Pattern Recognition Letters | VOL. 107
Vishwanath A Sindagi, et. al.Vishwanath A Sindagi ... Vishal M Patel
17 Jul 2017
Pattern Recognition Letters | VOL. 107

LITERATURE REVIEW ON CROWD COUNTING AND CROWD DENSITY MAPPING METHODOLOGIES
Bhat Sirish Mahadeva ... Chandan C Rao
International Journal of Engineering Applied Sciences and Technology | VOL. 6
Bhat Sirish Mahadeva, et. al.Bhat Sirish Mahadeva ... Chandan C Rao
01 Dec 2021
International Journal of Engineering Applied Sciences and Technology | VOL. 6

Mixture of counting CNNs
Shohei Kumagai ... Kazuhiro Hotta
Machine Vision and Applications | VOL. 29
Shohei Kumagai, et. al.Shohei Kumagai ... Kazuhiro Hotta
02 Jul 2018
Machine Vision and Applications | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ST-CNN: Spatial-Temporal Convolutional Neural Network for crowd counting in videos

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters