Abstract

ABSTRACT Traditional methods focus on low-level handcrafted features representations and it is difficult to design a comprehensive classification algorithm for remote sensing scene classification problems. Recently, convolutional neural networks (CNNs) have obtained remarkable performance outcomes, setting several remote sensing benchmarks. Furthermore, direct applications of UAV remote sensing images that use deep convolutional networks are extremely challenging given high input data dimensionality with relatively small amounts of available labelled data. We, therefore, propose a CNN approach to scene classification that architecturally incorporates sparse coding (SC) technique for dimension reduction to minimize overfitting. Outcomes were compared with principal component analysis (PCA) and global average pooling (GAP) alternatives that use fully connected layer(s) in pre-trained CNN architecture(s) to minimize overfitting. SC was used to encode deep features extracted from the last convolutional layer of pre-trained CNN models by using different features maps in which deep features had been converted into low-dimensional SC features. These same sparse-coded features were concatenated by means of different pooling techniques to obtain global image features for scene classification. The proposed algorithm outperformed current state-of-the-art algorithms based on handcrafted features. When using our own UAV-based dataset and existing datasets, it was also exceptionally efficient computationally when learning data representations, producing a 93.64% accuracy rate..

Highlights

  • Unmanned aerial vehicle (UAV) imagery is used for environment considerations and the monitoring of various resources

  • sparse coding (SC) was used to encode deep features extracted from the last convolutional layer of pre-trained convolutional neural networks (CNNs) models by using different features maps in which deep features had been converted into low-dimensional SC features

  • This is in addition to our comparison of results to those obtained by global average pooling (GAP) and principal component analysis (PCA) techniques that replaced fully connected layers with pre-trained CNN architecture(s)

Read more

Summary

Introduction

UAV imagery is used for environment considerations and the monitoring of various resources. These remotely acquired data hold spatial and spectral properties that can be analyzed and applied for modeling (Jaakkola et al, 2010). UAV vectors operate at low altitudes to obtain high-resolution images that cover relatively small areas (Gómez-Candón et al, 2014). The deep CNN models capture the data in hierarchical way and these models are based on sequential modules, where output of previous module is the input of module. The operations accomplished in a convolutional layer can be summarized as

Objectives
Methods
Findings
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.