Abstract

In this paper, the problem of multi-scale geospatial object detection in High Resolution Remote Sensing Images (HRRSI) is tackled. The different flight heights, shooting angles and sizes of geographic objects in the HRRSI lead to large scale variance in geographic objects. The inappropriate anchor size to propose the objects and the indiscriminative ability of features for describing the objects are the main causes of missing detection and false detection in multi-scale geographic object detection. To address these challenges, we propose a class-specific anchor based and context-guided multi-class object detection method with a convolutional neural network (CNN), which can be divided into two parts: a class-specific anchor based region proposal network (RPN) and a discriminative feature with a context information classification network. A class-specific anchor block providing better initial values for RPN is proposed to generate the anchor of the most suitable scale for each category in order to increase the recall ratio. Meanwhile, we proposed to incorporate the context information into the original convolutional feature to improve the discriminative ability of the features and increase classification accuracy. Considering the quality of samples for classification, the soft filter is proposed to select effective boxes to improve the diversity of the samples for the classifier and avoid missing or false detection to some extent. We also introduced the focal loss in order to improve the classifier in classifying the hard samples. The proposed method is tested on a benchmark dataset of ten classes to prove the superiority. The proposed method outperforms some state-of-the-art methods with a mean average precision (mAP) of 90.4% and better detects the multi-scale objects, especially when objects show a minor shape change.

Highlights

  • The emergence of High Resolution Remote Sensing Images (HRRSI) poses new challenges and requirements for the interpretation and recognition of remote sensing images

  • The CACMOD convolutional neural network (CNN) performs better in those categories, while Faster R-CNN is with some missing targets and false alarms

  • The size in specific-class anchors is more suitable than that in Faster Regions with CNN features (RCNN) especially for the small objects since they can provide the initial anchors covering the scales of all the categories

Read more

Summary

Introduction

The emergence of HRRSI poses new challenges and requirements for the interpretation and recognition of remote sensing images. Many methods were studied for object detection of HRRSI. The similarity measure is utilized to find the best matches of templates generated manually or from the labeled instances in the template matching-based object detection. These methods are sensitive to shape and perspective changes. Knowledge-based object detection methods represent the geometric and context information such as shape, geometry, spatial relationship and other features for object extraction in the form of rules to determine the objects satisfying the rules. Machine learning-based object detection methods mainly involve feature extraction, dimension reduction, classification and other processes [16]. The CNN-based object detection algorithms are studied in this paper

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call