Crowd Counting with Semantic Scene Segmentation in Helicopter Footage.

Gergely Csönde,Takehiro Kashiyama,Yoshihide Sekimoto

doi:10.3390/s20174855

Abstract

Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper.

Highlights

With the recent rapid developments in convolutional neural networks (CNNs), many image processing tasks that were very difficult a decade ago have become easier
We focus on indirect methods; density map estimator (DME) CNNs, the work presented can be applied to direct methods
Our data exhibited a linear correlation between the average L2/L1 norm ratio of the images and the change in the mean absolute error (MAE) caused by masking

Summary

Introduction

With the recent rapid developments in convolutional neural networks (CNNs), many image processing tasks that were very difficult a decade ago have become easier. The task of image-based crowd counting can be divided into two main categories: direct and indirect methods. In the former case, all individuals are separately identified in the image, following which the total number of humans is obtained by counting those individuals directly. We focus on indirect methods; density map estimator (DME) CNNs, the work presented can be applied to direct methods. Simple Masking with Separate CNNs. We took our most accurate segmentation model, which happened to be MTSM-CAN, and used it to mask the density maps for all three DME networks.

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Aug 27, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Crowd Counting with Semantic Scene Segmentation in Helicopter Footage.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Crowd Counting Using End-to-End Semantic Image Segmentation
Khalil Khan ... Shabana Habib
Electronics | VOL. 10
Khalil Khan, et. al.Khalil Khan ... Shabana Habib
28 May 2021
Electronics | VOL. 10

Improved RGBD semantic segmentation using multi-scale features
Xiaoning Gao ... Meng Cai
-
Xiaoning Gao, et. al.Xiaoning Gao ... Meng Cai
01 Jun 2018
01 Jun 2018

A new CNN-based semantic object segmentation for autonomous vehicles in urban traffic scenes
Gürkan Doğan ... Burhan Ergen
International Journal of Multimedia Information Retrieval | VOL. 13
Gürkan Doğan, et. al.Gürkan Doğan ... Burhan Ergen
23 Feb 2024
International Journal of Multimedia Information Retrieval | VOL. 13

Domain adaptive semantic segmentation by optimal transport
Yaqian Guo ... Shihui Ying
Fundamental Research | VOL. -
Yaqian Guo, et. al.Yaqian Guo ... Shihui Ying
01 Jul 2023
Fundamental Research | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Crowd Counting with Semantic Scene Segmentation in Helicopter Footage.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors