Multi-Layers Feature Fusion of Convolutional Neural Network for Scene Classification of Remote Sensing

Chenhui Ma,Dexuan Sha,Xiaodong Mu

doi:10.1109/access.2019.2936215

Chenhui Ma, Dexuan Sha + Show 1 more

Open Access

https://doi.org/10.1109/access.2019.2936215

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 48	License type: CC BY 4.0

Affiliation: George Mason University, Xi'an High Tech University

Abstract

Remote sensing scene classification is still a challenging task in remote sensing applications. How to effectively extract features from a dataset with limited scale is crucial for improvement of scene classification. Recently, convolutional neural network (CNN) performs impressively in different fields of computer vision and has been used for remote sensing. However, most works focus on the feature maps of the last convolution layer and pay little attention to the benefits of additional layers. In fact, the feature information hidden in different layers has potential for feature discrimination capacity. The most attention of this work is how to explore the potential of multiple layers from a CNN model. Therefore, this paper proposes multi-layers feature fusion based on CNN and designs a fusion module to solve relevant issues of fusion. In this module, firstly, all the feature maps are transformed to match sizes mutually due to infeasible fusion of feature maps with different scales; then, two fusion methods are introduced to integrate feature maps from different layers instead of the last convolution layer only; finally, the fusion of features are delivered to the next layer or classifier as the routine CNN does. The experimental results show that the suggested methods achieve promising performance on public datasets.

Highlights

Remote sensing scene classification is one of the most important remote sensing applications [1]
The main contribution of this work lines in: 1) we make full use of the basic operations of convolutional neural network (CNN) including pooling and convolution with 1 × 1 kernel to transform features with different sizes into the shape which can be fused together, and avoid introducing the traditional operators to increase the complexity of model structure; 2) we present a framework with two fusion methods, concatenation process and addition process, to integrate multi-layer features; 3) we discuss multilayer feature fusion applied for fusion in local parts and global model of CNN
Based on Inception-v3, concatenation process can achieve better accuracy for University of California Merced (UCM), while addition process obtains better results for NWPU-RESISC, while it seems difficult to judge which more suitable based on VGG-16 is; this indicates that different data distribution and CNN model require different fusion methods for intralayer fusion