Abstract

With an ever-increasing resolution of optical remote-sensing images, how to extract information from these images efficiently and effectively has gradually become a challenging problem. As it is prohibitively expensive to label every object in these high-resolution images manually, there is only a small number of high-resolution images with detailed object labels available, highly insufficient for common machine learning-based object detection algorithms. Another challenge is the huge range of object sizes: it is difficult to locate large objects, such as buildings and small objects, such as vehicles, simultaneously. To tackle these problems, we propose a novel neural network based remote sensing object detector called full-coverage collaborative network (FCC-Net). The detector employs various tailored designs, such as hybrid dilated convolutions and multi-level pooling, to enhance multiscale feature extraction and improve its robustness in dealing with objects of different sizes. Moreover, by utilizing asynchronous iterative training alternating between strongly supervised and weakly supervised detectors, the proposed method only requires image-level ground truth labels for training. To evaluate the approach, we compare it against a few state-of-the-art techniques on two large-scale remote-sensing image benchmark sets. The experimental results show that FCC-Net significantly outperforms other weakly supervised methods in detection accuracy. Through a comprehensive ablation study, we also demonstrate the efficacy of the proposed dilated convolutions and multi-level pooling in increasing the scale invariance of an object detector.

Highlights

  • Remote sensing is an interdisciplinary subject involving technologies [1] such as aviation, optical instrument, electronic sensor and computer science, etc

  • (1) We propose a novel end-to-end remote sensing object detection network (FCC-Net) combining a weakly supervised stronglysupervised supervised detector for addressing the challenge a weakly superviseddetector detector and and aa strongly detector for addressing the challenge of of insufficient labeled remote sensing data, which improves the performance of using insufficient labeled remote sensing data, which improves the performance of only usingonly imageimage-level labels training significant; level labels training significant; design a scale robust moduleon onthe thetop top of of the backbone dilated convolutions

  • Our experiments demonstrate that the proposed full-coverage residual network (FCRN) can increase the final receptive field to cover the entire area and avoid voids or loss of edge information, which directly improves the robustness of the detector to multiscale objects Electronics

Read more

Summary

Introduction

Remote sensing is an interdisciplinary subject involving technologies [1] such as aviation, optical instrument, electronic sensor and computer science, etc. One of the main drawbacks of these conventional techniques is their computational efficiency; they must examine a large number of sliding windows one-by-one Another problem is that, as object boundaries are often not well-defined in remote-sensing images, the result of feature extraction algorithms, such as scale-invariant features (SIFT) [10], direction gradient histogram (HOG) [11], etc., are unreliable for object detection. We propose a cascade, low-complexity multiscale feature fusion module to strengthen the backbone against scale variations caused by large difference of objects sizes in the remote-sensing images, which is proved to be effective in challenging RSOD tasks.

Related
Small Objects in Remote Sensing Images
Insufficient Training Examples of Remote Sensing Images
Foreground-Background Class Imbalance
Proposed Method
Objects
Cascade Multi-Level Pooling Module
Collaborative Detection SubNetwork for Weakly Supervised RSOD
2: Collaborative
Datasets
Precision–Recall Curve
Average Precision and Mean Average Precision
Correct Location
Implementation Details
Evaluation on TGRS-HRRSD Dataset
Methods
Evaluation on DIOR Dataset
Ablation Experiments
Qualitative Results
6.6.Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call