Abstract

Different from object detection in natural image, optical remote sensing object detection is a challenging task, due to the diverse meteorological conditions, complex background, varied orientations, scale variations, etc. In this paper, to address this issue, we propose a novel object detection network (the global-local saliency constraint network, GLS-Net) that can make full use of the global semantic information and achieve more accurate oriented bounding boxes. More precisely, to improve the quality of the region proposals and bounding boxes, we first propose a saliency pyramid which combines a saliency algorithm with a feature pyramid network, to reduce the impact of complex background. Based on the saliency pyramid, we then propose a global attention module branch to enhance the semantic connection between the target and the global scenario. A fast feature fusion strategy is also used to combine the local object information based on the saliency pyramid with the global semantic information optimized by the attention mechanism. Finally, we use an angle-sensitive intersection over union (IoU) method to obtain a more accurate five-parameter representation of the oriented bounding boxes. Experiments with a publicly available object detection dataset for aerial images demonstrate that the proposed GLS-Net achieves a state-of-the-art detection performance.

Highlights

  • Simultaneous localization and category recognition are the fundamental but challenging tasks of aerial image object detection

  • We describe the DOTA dataset, the implementation details, and the ablation studies conducted with the proposed method

  • The dataset contains a total of 15 categories, including plane, baseball diamond (BD), bridge, ground track field (GTF), small vehicle (SV), large vehicle (LV), ship, tennis court (TC), basketball court (BC), storage tank (ST), soccer-ball field (SBF), roundabout (RA), harbor, swimming pool (SP), and helicopter (HC)

Read more

Summary

Introduction

Simultaneous localization and category recognition are the fundamental but challenging tasks of aerial image object detection. Object detection in aerial images has become one of the hot topics in the computer vision field, and is used in a wide range of applications, such as traffic control, airport surveillance, monitoring of oil storage facilities, inshore ship detection, and military target discovery [1,2,3,4]. The difficulties of aerial image object detection are mainly due to the varied weather conditions and the variation of the orientation and scale of the objects. Here we define the objects in this article as the categories that are selected by experts in aerial image interpretation, according to whether a kind of objects is common and its value for real-world applications [11]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.