Abstract

Abstract. High-resolution aerial imagery can provide detailed and in some cases even real-time information about traffic related objects. Vehicle localization and counting using aerial imagery play an important role in a broad range of applications. Recently, convolutional neural networks (CNNs) with atrous convolution layers have shown better performance for semantic segmentation compared to conventional convolutional aproaches. In this work, we propose a joint vehicle segmentation and counting method based on atrous convolutional layers. This method uses a multi-task loss function to simultaneously reduce pixel-wise segmentation and vehicle counting errors. In addition, the rectangular shapes of vehicle segmentations are refined using morphological operations. In order to evaluate the proposed methodology, we apply it to the public “DLR 3K” benchmark dataset which contains aerial images with a ground sampling distance of 13 cm. Results show that our proposed method reaches 81.58 % mean intersection over union in vehicle segmentation and shows an accuracy of 91.12 % in vehicle counting, outperforming the baselines.

Highlights

  • Vehicle segmentation and counting in aerial imagery is of significant importance, as aerial imagery can provide valuable information over a large area in a short period of time

  • We investigate the effect of atrous convolutions in improving current convolutional neural networks (CNNs) for semantic segmentation of vehicles in aerial images

  • We evaluate our method on the “DLR 3K” dataset (Liu and Máttyus, 2015) which contains high-resolution aerial images with a ground sampling distances (GSDs) of 13 cm taken over the city of Munich, Germany

Read more

Summary

Introduction

Vehicle segmentation and counting in aerial imagery is of significant importance, as aerial imagery can provide valuable information over a large area in a short period of time. The automatic analysis of such images to segment and count vehicles can yield valuable information for multiple applications such as traffic monitoring, parking lot detection and utilization, and urban management. Thanks to the higher resolution, it is feasible to distinguish vehicles from each other. This is crucial in applications as the number of vehicles provides valuable insights over the captured area. Moving objects in airborne images, especially vehicles, appear to be of small size (e.g., 10 × 20 px) depending on the ground sampling distance

Objectives
Methods
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call