Abstract

Object detection from satellite images has been a challenging problem for many years. With the development of effective deep learning algorithms and advancement in hardware systems, higher accuracies have been achieved in the detection of various objects from very high-resolution (VHR) satellite images. This article provides a comparative evaluation of the state-of-the-art convolutional neural network (CNN)-based object detection models, which are Faster R-CNN, Single Shot Multi-box Detector (SSD), and You Look Only Once-v3 (YOLO-v3), to cope with the limited number of labeled data and to automatically detect airplanes in VHR satellite images. Data augmentation with rotation, rescaling, and cropping was applied on the test images to artificially increase the number of training data from satellite images. Moreover, a non-maximum suppression algorithm (NMS) was introduced at the end of the SSD and YOLO-v3 flows to get rid of the multiple detection occurrences near each detected object in the overlapping areas. The trained networks were applied to five independent VHR test images that cover airports and their surroundings to evaluate their performance objectively. Accuracy assessment results of the test regions proved that Faster R-CNN architecture provided the highest accuracy according to the F1 scores, average precision (AP) metrics, and visual inspection of the results. The YOLO-v3 ranked as second, with a slightly lower performance but providing a balanced trade-off between accuracy and speed. The SSD provided the lowest detection performance, but it was better in object localization. The results were also evaluated in terms of the object size and detection accuracy manner, which proved that large- and medium-sized airplanes were detected with higher accuracy.

Highlights

  • Object detection from satellite imagery has considerable importance in areas, such as defense and military applications, urban studies, airport surveillance, vessel traffic monitoring, and transportation infrastructure determination

  • Remote sensing images obtained from satellite sensors are much complex than computer vision images since these images are obtained from high altitudes, including interference from the atmosphere, viewpoint variation, background clutter, and illumination differences [1]

  • The default generated bounding boxes vary with the location and aspect ratio, and a scale process is applied by matching each ground truth box to a default box with the best jaccard overlapping value, which should be higher than 0.5 threshold

Read more

Summary

Introduction

Object detection from satellite imagery has considerable importance in areas, such as defense and military applications, urban studies, airport surveillance, vessel traffic monitoring, and transportation infrastructure determination. You Only Look Once (YOLO) [35] and Single Shot MultiBox Detector (SSD) [36] networks, which convert the classification and localization steps of the object detection task into a regression problem, can perform object detection tasks with a single neural network structure These new methods have overwhelmed the object proposal techniques in major competitions, such as PASCAL VOC (Pattern Analysis, Statistical Modelling and Computational Learning Visual Object Classes) [37] and COCO (Common Objects in Context) [38], where objects are detected from natural images. -- PPrroovviiddiinngg aa ccoommppaarraattiivvee eevvaalluuaattiioonn ooff oobbjjeecctt ddeetteeccttiioonn mmooddeellss aaccrroossss ddiiffffeerreenntt oobbjjeecctt ssiizzeess aanndd ddiiffffeerreenntt IIOOUUss aanndd pprreeffoorrmm aann iinnddeeppeennddeenntt eevvaalluuaattiioonn wwiitthh ffuullll--ssiizzeedd ((llaarrggee--ssccaallee)) PPlleeiiaaddeess ssaatteelllliittee iimmaaggeess tthhaatt hhaavvee ddiiffffeerreenntt rreessoolluuttiioonn ssppeeccss tthhaann tthhee ttrraaiinniinngg ddaattaasseett ttoo iinnvveessttiiggaattee tthhee ttrraannssffeerraabbiilliittyy

Data and Methods
Data and Augmentation
SSD Network Framework
Default Bounding Boxes and Negative Sample Generation
Loss Function
Residual Blocks
Training
Results and Discussion
Evaluation Metrics
Evaluation with COCO API
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call