Abstract

Remote sensing scene classification is still a challenging task in remote sensing applications. How to effectively extract features from a dataset with limited scale is crucial for improvement of scene classification. Recently, convolutional neural network (CNN) performs impressively in different fields of computer vision and has been used for remote sensing. However, most works focus on the feature maps of the last convolution layer and pay little attention to the benefits of additional layers. In fact, the feature information hidden in different layers has potential for feature discrimination capacity. The most attention of this work is how to explore the potential of multiple layers from a CNN model. Therefore, this paper proposes multi-layers feature fusion based on CNN and designs a fusion module to solve relevant issues of fusion. In this module, firstly, all the feature maps are transformed to match sizes mutually due to infeasible fusion of feature maps with different scales; then, two fusion methods are introduced to integrate feature maps from different layers instead of the last convolution layer only; finally, the fusion of features are delivered to the next layer or classifier as the routine CNN does. The experimental results show that the suggested methods achieve promising performance on public datasets.

Highlights

  • Remote sensing scene classification is one of the most important remote sensing applications [1]

  • The main contribution of this work lines in: 1) we make full use of the basic operations of convolutional neural network (CNN) including pooling and convolution with 1 × 1 kernel to transform features with different sizes into the shape which can be fused together, and avoid introducing the traditional operators to increase the complexity of model structure; 2) we present a framework with two fusion methods, concatenation process and addition process, to integrate multi-layer features; 3) we discuss multilayer feature fusion applied for fusion in local parts and global model of CNN

  • Based on Inception-v3, concatenation process can achieve better accuracy for University of California Merced (UCM), while addition process obtains better results for NWPU-RESISC, while it seems difficult to judge which more suitable based on VGG-16 is; this indicates that different data distribution and CNN model require different fusion methods for intralayer fusion

Read more

Summary

Introduction

Remote sensing scene classification is one of the most important remote sensing applications [1].

Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.