Active Deep Learning for Classification of Hyperspectral Images
Active deep learning classification of hyperspectral images is considered in this paper. Deep learning has achieved success in many applications, but good-quality labeled samples are needed to construct a deep learning network. It is expensive getting good labeled samples in hyperspectral images for remote sensing applications. An active learning algorithm based on a weighted incremental dictionary learning is proposed for such applications. The proposed algorithm selects training samples that maximize two selection criteria, namely representative and uncertainty. This algorithm trains a deep network efficiently by actively selecting training samples at each iteration. The proposed algorithm is applied for the classification of hyperspectral images, and compared with other classification algorithms employing active learning. It is shown that the proposed algorithm is efficient and effective in classifying hyperspectral images.
- Research Article
16
- 10.1109/jstars.2023.3237566
- Jan 1, 2023
- IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
In recent years, deep neural networks have been widely used for hyperspectral image (HSI) classification and have shown excellent performance using numerous labeled samples. The acquisition of HSI labels is usually based on the field investigation, which is expensive and time consuming. Hence, the available labels are usually limited, which affects the efficiency of deep HSI classification methods. To improve the classification performance while reducing the labeling cost, this article proposes a semisupervised deep learning (DL) method for HSI classification, named pyramidal dilation attention convolutional network with active and self-paced learning (PDAC-ASPL), which integrates active learning (AL), self-paced learning (SPL), and DL into a unified framework. First, a densely connected pyramidal dilation attention convolutional network is trained with a limited number of labeled samples. Then, the most informative samples from the unlabeled set are selected by AL and queried real labels, and the highest confidence samples with corresponding pseudo labels are extracted by SPL. Finally, the samples from AL and SPL are added to the training set to retrain the network. Compared with some DL- and AL-based HSI classification methods, our PDAC-ASPL achieves better performance on four HSI datasets.
- Research Article
19
- 10.3390/rs15030752
- Jan 28, 2023
- Remote Sensing
Hyperspectral image (HSI) classification has recently been successfully explored by using deep learning (DL) methods. However, DL models rely heavily on a large number of labeled samples, which are laborious to obtain. Therefore, finding a way to efficiently embed DL models in limited labeled samples is a hot topic in the field of HSI classification. In this paper, an active learning-based siamese network (ALSN) is proposed to solve the limited labeled samples problem in HSI classification. First, we designed a dual learning-based siamese network (DLSN), which consists of a contrastive learning module and a classification module. Secondly, in view of the problem that active learning is difficult to effectively sample under the extremely limited labeling cost, we proposed an adversarial uncertainty-based active learning (AUAL) method to query valuable samples, and to promote DLSN to learn a more complete feature distribution by fine-tuning. Finally, an active learning architecture, based on inter-class uncertainty (ICUAL), is proposed to construct a lightweight sample pair training set, fully extracting the inter-class information of sample pairs and improving classification accuracy. Experiments on three generic HSI datasets strongly demonstrated the effectiveness of ALSN for HSI classification, with performance improvements over other related DL methods.
- Research Article
359
- 10.1109/tgrs.2020.2964627
- Jul 1, 2020
- IEEE Transactions on Geoscience and Remote Sensing
Deep neural network has been extensively applied to hyperspectral image (HSI) classification recently. However, its success is greatly attributed to numerous labeled samples, whose acquisition costs a large amount of time and money. In order to improve the classification performance while reducing the labeling cost, this article presents an active deep learning approach for HSI classification, which integrates both active learning and deep learning into a unified framework. First, we train a convolutional neural network (CNN) with a limited number of labeled pixels. Next, we actively select the most informative pixels from the candidate pool for labeling. Then, the CNN is fine-tuned with the new training set constructed by incorporating the newly labeled pixels. This step together with the previous step is iteratively conducted. Finally, Markov random field (MRF) is utilized to enforce class label smoothness to further boost the classification performance. Compared with the other state-of-the-art traditional and deep learning-based HSI classification methods, our proposed approach achieves better performance on three benchmark HSI data sets with significantly fewer labeled samples.
- Research Article
171
- 10.1109/tgrs.2018.2868851
- Mar 1, 2019
- IEEE Transactions on Geoscience and Remote Sensing
Deep learning has recently attracted significant attention in the field of hyperspectral images (HSIs) classification. However, the construction of an efficient deep neural network (DNN) mostly relies on a large number of labeled samples being available. To address this problem, this paper proposes a unified deep network, combined with active transfer learning that can be well-trained for HSIs classification using only minimally labeled training data. More specifically, deep joint spectral-spatial feature is first extracted through hierarchical stacked sparse autoencoder (SSAE) networks. Active transfer learning is then exploited to transfer the pre-trained SSAE network and the limited training samples from the source domain to the target domain, where the SSAE network is subsequently fine-tuned using the limited labeled samples selected from both source and target domain by corresponding active learning strategies. The advantages of our proposed method are threefold: 1) the network can be effectively trained using only limited labeled samples with the help of novel active learning strategies; 2) the network is flexible and scalable enough to function across various transfer situations, including cross-dataset and intra-image; 3) the learned deep joint spectral-spatial feature representation is more generic and robust than many joint spectral-spatial feature representation. Extensive comparative evaluations demonstrate that our proposed method significantly outperforms many state-of-the-art approaches, including both traditional and deep network-based methods, on three popular datasets.
- Discussion
11
- 10.3390/s20174975
- Sep 2, 2020
- Sensors (Basel, Switzerland)
Relying on large scale labeled datasets, deep learning has achieved good performance in image classification tasks. In agricultural and biological engineering, image annotation is time-consuming and expensive. It also requires annotators to have technical skills in specific areas. Obtaining the ground truth is difficult because natural images are expensive. In addition, images in these areas are usually stored as multichannel images, such as computed tomography (CT) images, magnetic resonance images (MRI), and hyperspectral images (HSI). In this paper, we present a framework using active learning and deep learning for multichannel image classification. We use three active learning algorithms, including least confidence, margin sampling, and entropy, as the selection criteria. Based on this framework, we further introduce an “image pool” to make full advantage of images generated by data augmentation. To prove the availability of the proposed framework, we present a case study on agricultural hyperspectral image classification. The results show that the proposed framework achieves better performance compared with the deep learning model. Manual annotation of all the training sets achieves an encouraging accuracy. In comparison, using active learning algorithm of entropy and image pool achieves a similar accuracy with only part of the whole training set manually annotated. In practical application, the proposed framework can remarkably reduce labeling effort during the model development and upadting processes, and can be applied to multichannel image classification in agricultural and biological engineering.
- Conference Article
19
- 10.1109/icip.2017.8296306
- Sep 1, 2017
Hyperspectral image(HSI) Classification is one of the most prevalent issue in remote sensing area. Recently, application of deep learning in HSI classification has emerged. However, merging spatial features with spectral properties in deep learning is a pervasive problem. This paper presents, a discriminative spatial updated deep belief network (SDBN) which effectively utilizes spatial information within spectrally identical contiguous pixels for HSI classification. In the proposed approach, HSI is first segmented into adaptive boundary adjustment based spatially similar regions with similar spectral features, following which an object-level feature extraction and classification is undertaken using deep belief network (DBN) based decision fusion approach that incorporate spatial-segmented contextual and spectral information into a DBN framework for effective spectral-spatial HSI classification. Moreover, for improved accuracy, band preference/correlation based feature selection approach is used to select the most informative bands without compromising the original content in HSI. Usage of local contextual features and spectral similarity from adaptive boundary adjustment based approach, and integration of spatial and spectral features into DBN results into improved accuracy of the final HSI classification. Experimental results on well known hyperspectral data indicates the classification accuracy of the proposed method over several existing techniques.
- Research Article
34
- 10.3390/rs12233879
- Nov 26, 2020
- Remote Sensing
Deep learning classifiers exhibit remarkable performance for hyperspectral image classification given sufficient labeled samples but show deficiency in the situation of learning with limited labeled samples. Active learning endows deep learning classifiers with the ability to alleviate this deficiency. However, existing active deep learning methods tend to underestimate the feature variability of hyperspectral images when querying informative unlabeled samples subject to certain acquisition heuristics. A major reason for this bias is that the acquisition heuristics are normally derived based on the output of a deep learning classifier, in which representational power is bounded by the number of labeled training samples at hand. To address this limitation, we developed a feature-oriented adversarial active learning (FAAL) strategy, which exploits the high-level features from one intermediate layer of a deep learning classifier for establishing an acquisition heuristic based on a generative adversarial network (GAN). Specifically, we developed a feature generator for generating fake high-level features and a feature discriminator for discriminating between the real high-level features and the fake ones. Trained with both the real and the fake high-level features, the feature discriminator comprehensively captures the feature variability of hyperspectral images and yields a powerful and generalized discriminative capability. We leverage the well-trained feature discriminator as the acquisition heuristic to measure the informativeness of unlabeled samples. Experimental results validate the effectiveness of both (i) the full FAAL framework and (ii) the adversarially learned acquisition heuristic, for the task of classifying hyperspectral images with limited labeled samples.
- Research Article
8
- 10.3390/rs16060990
- Mar 12, 2024
- Remote Sensing
In hyperspectral image (HSI) classification scenarios, deep learning-based methods have achieved excellent classification performance, but often rely on large-scale training datasets to ensure accuracy. However, in practical applications, the acquisition of hyperspectral labeled samples is time consuming, labor intensive and costly, which leads to a scarcity of obtained labeled samples. Suffering from insufficient training samples, few-shot sample conditions limit model training and ultimately affect HSI classification performance. To solve the above issues, an active learning (AL)-based multipath residual involution Siamese network for few-shot HSI classification (AL-MRIS) is proposed. First, an AL-based Siamese network framework is constructed. The Siamese network, which has relatively low demand for sample data, is adopted for classification, and the AL strategy is integrated to select more representative samples to improve the model’s discriminative ability and reduce the costs of labeling samples in practice. Then, the multipath residual involution (MRIN) module is designed for the Siamese subnetwork to obtain the comprehensive features of the HSI. The involution operation was used to capture the fine-grained features and effectively aggregate the contextual semantic information of the HSI through dynamic weights. The MRIN module comprehensively considers the local features, dynamic features and global features through multipath residual connections, which improves the representation ability of HSIs. Moreover, a cosine distance-based contrastive loss is proposed for the Siamese network. By utilizing the directional similarity of high-dimensional HSI data, the discriminability of the Siamese classification network is improved. A large number of experimental results show that the proposed AL-MRIS method can achieve excellent classification performance with few-shot training samples, and compared with several state-of-the-art classification methods, the AL-MRIS method obtains the highest classification accuracy.
- Research Article
357
- 10.1016/j.neucom.2021.03.035
- Mar 23, 2021
- Neurocomputing
With the rapid development of deep learning technology and improvement in computing capability, deep learning has been widely used in the field of hyperspectral image (HSI) classification. In general, deep learning models often contain many trainable parameters and require a massive number of labeled samples to achieve optimal performance. However, in regard to HSI classification, a large number of labeled samples is generally difficult to acquire due to the difficulty and time-consuming nature of manual labeling. Therefore, many research works focus on building a deep learning model for HSI classification with few labeled samples. In this article, we concentrate on this topic and provide a systematic review of the relevant literature. Specifically, the contributions of this paper are twofold. First, the research progress of related methods is categorized according to the learning paradigm, including transfer learning, active learning and few-shot learning. Second, a number of experiments with various state-of-the-art approaches has been carried out, and the results are summarized to reveal the potential research directions. More importantly, it is notable that although there is a vast gap between deep learning models (that usually need sufficient labeled samples) and the HSI scenario with few labeled samples, the issues of small-sample sets can be well characterized by fusion of deep learning methods and related techniques, such as transfer learning and a lightweight model. For reproducibility, the source codes of the methods assessed in the paper can be found at https://github.com/ShuGuoJ/HSI-Classification.git.
- Research Article
28
- 10.1109/access.2019.2963624
- Jan 1, 2020
- IEEE Access
In Hyperspectral image (HSI) classification, combining spectral information with spatial information has become an efficient measure to obtain good classification results, where spatial information is generally introduced in an unsupervised way or some complicated way. We introduce spatial coordinates as the spatial information in a simple supervised way and propose two HSI classification algorithms, where spatial coordinates of samples are regarded as the spatial features of samples. A spectral-spatial classification algorithm is proposed, named as HSI Classification Based on Spectral-Spatial Feature Fusion using Spatial Coordinates (SSFFSC). The HSI is divided into multiple small images in spatial dimension, and samples in each small image are randomly selected as training samples. Support vector machine (SVM) is used to classify the samples to obtain the probability of samples belonging to each class according to the spatial coordinate features and spectral features respectively. The probability features are further classified by SVM to achieve the final classification result. Considering that the performance of SSFFSC relies on the partition of HSI, SSFFSC is further combined with active learning (AL) as a new method named as HSI Classification Based on Active Learning and SSFFSC (SSFFSC-AL). Partition of HSI is omitted and the training samples are selected adaptively by AL’s sampling scheme. We find spatial coordinates are useful spatial information. SSFFSC and SSFFSC-AL run fast and improve the classification accuracy effectively by using the spatial coordinates as the spatial features. Experiments demonstrate that comparing with other algorithms, SSFFSC and SSFFSC-AL can obtain higher classification accuracy in less time.
- Research Article
66
- 10.1109/tgrs.2022.3149947
- Jan 1, 2022
- IEEE Transactions on Geoscience and Remote Sensing
In the field of hyperspectral image (HSI) classification, deep learning has helped achieve great successes. However, most of these achievements are made with very large amounts of labeled training data. Manual annotation of HSIs is labor intensive and time consuming. In practical HSI classification, there may only be a few labeled samples available. To perform HSI classification with a small number of labeled samples, a new few-shot classification model based on adaptive subspaces and featurewise transformation is proposed in this article. First, we design a 3-D local channel attention residual network to obtain the spatial–spectral features of HSIs. Then, a featurewise transformation strategy is introduced to enhance feature diversity to avoid model overfitting problems and to mitigate the impact of cross-domain problems. Finally, a subspace classifier is implemented to construct different subspace categories based on the embedded features of the limited labeled samples. Classification of an HSI sample is performed using spatial projection and a distance metric. The proposed model is trained using the metalearning mechanism to perform few-shot classification of HSIs. Four public datasets are utilized to construct a sufficient few-shot classification task named episodes for training. The other three public datasets are used to test the proposed model. Experiments show that our proposed method can outperform mainstream small sample HSI classification methods.
- Research Article
23
- 10.3390/rs14030596
- Jan 26, 2022
- Remote Sensing
Deep neural networks (DNNs) have promoted much of the recent progress in hyperspectral image (HSI) classification, which depends on extensive labeled samples and deep network structure and has achieved surprisingly good generalization capacity. However, due to the expensive labeling cost, the labeled samples are scarce in most practice cases, which causes these DNN-based methods to be prone to over-fitting and influences the classification result. To mitigate this problem, we present a clustering-inspired active learning method for enhancing the HSI classification result, which mainly contributes to two aspects. On one hand, the modified clustering by fast search and find of peaks clustering method is utilized to select highly informative and diverse samples from unlabeled samples in the candidate set for manual labeling, which empowers us to appropriately augment the limited training set (i.e., labeled samples) and thus improves the generalization capacity of the baseline DNN model. On the other hand, another K-means clustering-based pseudo-labeling scheme is utilized to pre-train the DNN model with all samples in the candidate set. By doing this, the pre-trained model can be effectively generalized to unlabeled samples in the testing set after being fine tuned-based on the augmented training set. The experiment accuracies on two benchmark HSI datasets show the effectiveness of the proposed method.
- Research Article
53
- 10.3390/ijgi7020065
- Feb 20, 2018
- ISPRS International Journal of Geo-Information
This paper investigates recent research on active learning for (geo) text and image classification, with an emphasis on methods that combine visual analytics and/or deep learning. Deep learning has attracted substantial attention across many domains of science and practice, because it can find intricate patterns in big data; but successful application of the methods requires a big set of labeled data. Active learning, which has the potential to address the data labeling challenge, has already had success in geospatial applications such as trajectory classification from movement data and (geo) text and image classification. This review is intended to be particularly relevant for extension of these methods to GISience, to support work in domains such as geographic information retrieval from text and image repositories, interpretation of spatial language, and related geo-semantics challenges. Specifically, to provide a structure for leveraging recent advances, we group the relevant work into five categories: active learning, visual analytics, active learning with visual analytics, active deep learning, plus GIScience and Remote Sensing (RS) using active learning and active deep learning. Each category is exemplified by recent influential work. Based on this framing and our systematic review of key research, we then discuss some of the main challenges of integrating active learning with visual analytics and deep learning, and point out research opportunities from technical and application perspectives—for application-based opportunities, with emphasis on those that address big data with geospatial components.
- Research Article
2
- 10.1117/1.jrs.15.016512
- Feb 26, 2021
- Journal of Applied Remote Sensing
Hyperspectral image (HSI) classification is a challenging problem due to the high dimensional features, high intra-class variance, and limited prior information, and the classification is the basis for HSI applications. Active learning (AL) and semisupervised learning (SSL) are two promising approaches in the HSI classification. In AL, the traditional entropy query-by-bagging (EQB) algorithm only pays attention on uncertainty and ignore the diversity among the samples. Therefore, we propose averaged normalized entropy query-by-bagging (anEQB) algorithm. Meanwhile, the collaborative active learning and semisupervised learning framework (CASSL) may invoke many wrong pseudolabels and deteriorate the classification performance. To make up for the deficiency of CASSL, we complement different AL algorithms to constitute a multiple filtering mode semisupervised learning framework (MFMSLF). To further study, we introduce syncretic secondary filtering mode into multiple verification semisupervised framework and thus constitute a multiple secondary filtering mode semisupervised verification framework (MSFMSVF). We evaluate the performance of anEQB, MFMSLF, and MSFMSVF on different hyperspectral data sets and compare them with other state-of-the-art HSI classification methods. Numerical experimental results reveal the superior classification performance of anEQB, MFMSLF, and MSFMSVF, respectively. Experimental results also demonstrate that exploring the information and diversity of the samples from different criterion can improve the classification performance of the collaborative framework.
- Research Article
71
- 10.3390/rs11242974
- Dec 11, 2019
- Remote Sensing
Random forest (RF) has obtained great success in hyperspectral image (HSI) classification. However, RF cannot leverage its full potential in the case of limited labeled samples. To address this issue, we propose a unified framework that embeds active learning (AL) and semi-supervised learning (SSL) into RF (ASSRF). Our aim is to utilize AL and SSL simultaneously to improve the performance of RF. The objective of the proposed method is to use a small number of manually labeled samples to train classifiers with relative high classification accuracy. To achieve this goal, a new query function is designed to query the most informative samples for manual labeling, and a new pseudolabeling strategy is introduced to select some samples for pseudolabeling. Compared with other AL- and SSL-based methods, the proposed method has several advantages. First, ASSRF utilizes the spatial information to construct a query function for AL, which can select more informative samples. Second, in addition to providing more labeled samples for SSL, the proposed pseudolabeling method avoids bias caused by AL-labeled samples. Finally, the proposed model retains the advantages of RF. To demonstrate the effectiveness of ASSRF, we conducted experiments on three real hyperspectral data sets. The experimental results have shown that our proposed method outperforms other state-of-the-art methods.