Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

Sara Akodad,Yannick Berthoumieu,Junshi Xia,Lionel Bombrun,Christian Germain

doi:10.3390/rs12203292

Abstract

Remote sensing image scene classification, which consists of labeling remote sensing images with a set of categories based on their content, has received remarkable attention for many applications such as land use mapping. Standard approaches are based on the multi-layer representation of first-order convolutional neural network (CNN) features. However, second-order CNNs have recently been shown to outperform traditional first-order CNNs for many computer vision tasks. Hence, the aim of this paper is to show the use of second-order statistics of CNN features for remote sensing scene classification. This takes the form of covariance matrices computed locally or globally on the output of a CNN. However, these datapoints do not lie in an Euclidean space but a Riemannian manifold. To manipulate them, Euclidean tools are not adapted. Other metrics should be considered such as the log-Euclidean one. This consists of projecting the set of covariance matrices on a tangent space defined at a reference point. In this tangent plane, which is a vector space, conventional machine learning algorithms can be considered, such as the Fisher vector encoding or SVM classifier. Based on this log-Euclidean framework, we propose a novel transfer learning approach composed of two hybrid architectures based on covariance pooling of CNN features, the first is local and the second is global. They rely on the extraction of features from models pre-trained on the ImageNet dataset processed with some machine learning algorithms. The first hybrid architecture consists of an ensemble learning approach with the log-Euclidean Fisher vector encoding of region covariance matrices computed locally on the first layers of a CNN. The second one concerns an ensemble learning approach based on the covariance pooling of CNN features extracted globally from the deepest layers. These two ensemble learning approaches are then combined together based on the strategy of the most diverse ensembles. For validation and comparison purposes, the proposed approach is tested on various challenging remote sensing datasets. Experimental results exhibit a significant gain of approximately 2% in overall accuracy for the proposed approach compared to a similar state-of-the-art method based on covariance pooling of CNN features (on the UC Merced dataset).

Highlights

The aim of a supervised classification algorithm consists of labeling an image with the corresponding class according to its content
In order to the benefit of pre-trained deep neural networks and second-order representations, this work aims at proposing a novel ensemble learning approach based on covariance pooling of convolutional neural network (CNN) features for remote sensing scene classification
We have introduced in [44] a novel hybrid approach named ELCP, which consists of an ensemble learning approach based on covariance pooling of CNN features

Summary

Introduction

The aim of a supervised classification algorithm consists of labeling an image with the corresponding class according to its content. Conventional approaches are based on encoding handcrafted features with, for example, the bag of words model (BoW) [1], the vector of locally aggregated descriptors (VLAD) [2,3] or the Fisher vectors (FV) [4,5,6]. These latter strategies have proved successful results in a wide range of applications such as image classification [4,7,8], text retrieval [9], action and face recognition [10], etc. CNN is built from various hidden layers performing different kinds of transformation, such as convolutions, pooling, and activation functions

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Oct 10, 2020
Citations: 19	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Remote Sensing Scene Classification Based on Covariance Pooling of Multi-layer CNN Features Guided by Saliency Maps
Sara Akodad ... Christian Germain
-
Sara Akodad, et. al.Sara Akodad ... Christian Germain
01 Jan 2021
01 Jan 2021

Self-Attention Network With Joint Loss for Remote Sensing Image Scene Classification
Honglin Wu ... Chaoquan Lu
IEEE Access | VOL. 8
Honglin Wu, et. al.Honglin Wu ... Chaoquan Lu
01 Jan 2020
IEEE Access | VOL. 8

When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs
Gong Cheng ... Junwei Han
IEEE Transactions on Geoscience and Remote Sensing | VOL. 56
Gong Cheng, et. al.Gong Cheng ... Junwei Han
01 May 2018
IEEE Transactions on Geoscience and Remote Sensing | VOL. 56

Lightweight Channel Attention and Multiscale Feature Fusion Discrimination for Remote Sensing Scene Classification
Huiyao Wan ... Jie Chen
IEEE Access | VOL. 9
Huiyao Wan, et. al.Huiyao Wan ... Jie Chen
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble Learning Approaches Based on Covariance Pooling of CNN Features for High Resolution Remote Sensing Scene Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing