Redesigned Skip-Network for Crowd Counting with Dilated Convolution and Backward Connection.

Sorn Sooksatra,Atsuo Yoshitaka,Toshiaki Kondo,Pished Bunnun

doi:10.3390/jimaging6050028

Abstract

Crowd counting is a challenging task dealing with the variation of an object scale and a crowd density. Existing works have emphasized on skip connections by integrating shallower layers with deeper layers, where each layer extracts features in a different object scale and crowd density. However, only high-level features are emphasized while ignoring low-level features. This paper proposes an estimation network by passing high-level features to shallow layers and emphasizing its low-level feature. Since an estimation network is a hierarchical network, a high-level feature is also emphasized by an improved low-level feature. Our estimation network consists of two identical networks for extracting a high-level feature and estimating the final result. To preserve semantic information, dilated convolution is employed without resizing the feature map. Our method was tested in three datasets for counting humans and vehicles in a crowd image. The counting performance is evaluated by mean absolute error and root mean squared error indicating the accuracy and robustness of an estimation network, respectively. The experimental result shows that our network outperforms other related works in a high crowd density and is effective for reducing over-counting error in the overall case.

Highlights

In recent years, crowd counting has an important role in a wide range of applications
An actual density map derived from Equation (2) was calculated as the ground truth of an estimation network
These connections help to improve the quality of feature maps in shallow layers by inserting the high-level features from deeper layers

Summary

Introduction

Crowd counting has an important role in a wide range of applications (e.g., video surveillance, traffic monitoring [1], public safety, urban planning [2], and so on). To solve the scaling problem, estimation networks from previous studies have trended to implement with several image resolutions and convolutional layers (Conv layers) for handling various object scales Their valuable information could be lost by resizing crowd images and their results were limited by the network configuration in different datasets. Skip connections have been added to the estimation network for reusing feature maps, rather than building other networks This technique is called ‘skip-network’ which is effective only on objects with large scales because high-level feature maps are only emphasized. The proposed network utilized the backward connection for passing high-level features from deeper layers to shallower layers and reducing false positives in crowd counting. A general flowchart of crowd counting using deep learning-based technique

Related Work

Skip-Network

Multi-Scale Network

Density Map for Object Counting

Estimation Network Architecture

Backward Connection

Training Method

Data Augmentation

Experimental Results

Evaluation Metric

Crowd Counting Dataset

TRANCOS Dataset

Ablations on Shanghaitech Part A

Counting Evaluation and Comparison

Shanghaitech Dataset

Density Map Assessment

Crowd Image Quality

Effect on Dilated Convolution

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Imaging	Publication Date: May 2, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Redesigned Skip-Network for Crowd Counting with Dilated Convolution and Backward Connection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Imaging

Lead the way for us

Similar Papers

A High-Density Crowd Counting Method Based on Convolutional Feature Fusion
Hongling Luo ... Weiqun Wu
Applied Sciences | VOL. 8
Hongling Luo, et. al.Hongling Luo ... Weiqun Wu
23 Nov 2018
Applied Sciences | VOL. 8

Deep Saliency with Encoded Low Level Distance Map and High Level Features
Gayoung Lee ... Junmo Kim
-
Gayoung Lee, et. al.Gayoung Lee ... Junmo Kim
01 Jun 2016
01 Jun 2016

Generating high quality crowd density map based on perceptual loss
Zheyi Fan ... Yixuan Zhu
Applied Intelligence | VOL. 50
Zheyi Fan, et. al.Zheyi Fan ... Yixuan Zhu
17 Dec 2019
Applied Intelligence | VOL. 50

Multi‐level feature fusion network for crowd counting
Luyang Wang ... Yun Li
IET Computer Vision | VOL. 15
Luyang Wang, et. al.Luyang Wang ... Yun Li
01 Feb 2021
IET Computer Vision | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Redesigned Skip-Network for Crowd Counting with Dilated Convolution and Backward Connection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Imaging