Abstract

Most existing deep learning-based methods use feature maps extracted from convolutional neural networks (CNNs) for classification and detection of high-resolution remote sensing images (HRSIs). However, directly applying these features to classification and object detection in HRSI is problematic because of rotational variations. In this article, we design networks using the polycentric circle pooling (PCP) strategy to alleviate the abovementioned problem. The PCP network (PCP-net) structure can generate a fixed-length representation for different input image sizes and encode rotation-invariant information. With these advantages, PCP-net should in general improve the CNN-based HRSI classification methods. Specifically, on the basis of the concentric circle pooling network structure, we improve the structure using multiple concentric circle centers to generate more robust rotation-invariant information. Using two challenging HRSI scene datasets, we prove that PCP-net improves the accuracy of CNN architectures for a scene classification tasks. PCP-net can be conveniently applied to object detection because the output size is fixed regardless of image size. Experiments applying the faster region-CNN to a publicly available ten-class object detection dataset demonstrate that our proposed PCP can achieve accuracy higher than that of a region of interest pooling in the HRSI object detection task.

Highlights

  • R ECENT advances in a wide range of computer vision applications have been driven by convolutional neural networks (CNNs) [1] and the availability of large-scale training data [2]

  • We describe in detail our polycentric circle pooling (PCP) network (PCPnet) architecture for high-resolution remote sensing images (HRSIs) recognition

  • This article presents a simple but effective method for handling the rotation variance problem in CNNs for HRSI classification. This issue is important in HRSI recognition in the context of a deep network

Read more

Summary

INTRODUCTION

R ECENT advances in a wide range of computer vision applications have been driven by convolutional neural networks (CNNs) [1] and the availability of large-scale training data [2]. CNNs can certainly alleviate the effect of local transformation by convolution and pooling operations, but lack the ability to handle large image rotation [12] This drawback limits the performance of CNNs in HRSI classification tasks. We propose a polycentric circle pooling (PCP) method to learn a rotation-invariant CNN (RICNN) based model for HRSI classification. We implement a new CCP method with multiple circle centers to obtain more robust rotation-invariant information for the CNN-based HRSI classification and detection tasks. 1) Given the deformation of objects in HRSIs, we fuse CCP with multiple circle centers to capture more robust spatial rotation-invariant information. 2) This work applies CCP to the faster region-based convolutional neural network (R-CNN) [7] for geospatial object detection.

RELATED WORK
PCP Layer
Training the Network
PCP-NET FOR SCENE CLASSIFICATION
Experimental Datasets
Parameter Optimization
Dataset and Experimental Setup
Detection Results
Running Time
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call