Abstract

The aerial scene classification is one of the major tasks in the remote sensing community that automatically labels the corresponding semantic categories of aerial images. Recently, a lot of methods based on deep neural networks have been proposed, in which hierarchical internal feature are extracted for representations. However, these presented methods often have complex structures and require large volume of memory, and a large number of labeled aerial scene images are difficult to obtain, hindering their implementation in practical applications. In this paper, we present the between-class similarity priori and adaptive knowledge mimic (BPKM) method for aerial scene classification. First, the method extracts the efficient prior relationship information of the scene images from large-scale network. Then, a compressed network is generated through learning the output and the intermediate representations of the large-scale network, and the compressed network achieves better feature description ability; in addition, an improved cross-entropy method with an adaptive threshold is applied to reduce the training time consumption. A large-scale data set (AID) and UC-Merced data set are considered for performance evaluation, and the experimental results indicate that the proposed method is about $24\times $ parameters saving compared to popular networks, e.g., AlexNet, and has outstanding classification performance in classes with similar features.

Highlights

  • Scene classification has been a significant topic in the remote sensing field

  • (3) The third relies on deep learning (DL) to adaptively learn hierarchical features, so as to establish the mapping relationships from raw pixels to high-level semantics, such as AlexNet [6], ResNet [7], and so on

  • In order to solve the problem that convergence on aerial image training is time-consuming, we propose an improved crossentropy method based on adaptive threshold control which can decide whether to stop training according to the value of cross-entropy threshold

Read more

Summary

Introduction

Scene classification has been a significant topic in the remote sensing field. The category of each scene depends on the involved object and determined by each semantic region and its hierarchical structure or spatial layout, which is different from the generic object classification task. (3) The third relies on deep learning (DL) to adaptively learn hierarchical features, so as to establish the mapping relationships from raw pixels to high-level semantics, such as AlexNet [6], ResNet [7], and so on. Structural information of remote sensing images, for example, Scale Invariant Feature Transform (SIFT) [1], Local Binary Pattern (LBP) [2], and so on. Among these methods, the DL-related studies are of great concern, which outperformed most of the other conventional methods on visual recognition.

Objectives
Methods
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call