Self-supervised learning for remote sensing scene classification under the few shot scenario

Najd Alosaimi,Naif Alajlan,Yakoub Bazi,Haikel Alhichri,Belgacem Ben Youssef

doi:10.1038/s41598-022-27313-5

Abstract

Scene classification is a crucial research problem in remote sensing (RS) that has attracted many researchers recently. It has many challenges due to multiple issues, such as: the complexity of remote sensing scenes, the classes overlapping (as a scene may contain objects that belong to foreign classes), and the difficulty of gaining sufficient labeled scenes. Deep learning (DL) solutions and in particular convolutional neural networks (CNN) are now state-of-the-art solution in RS scene classification; however, CNN models need huge amounts of annotated data, which can be costly and time-consuming. On the other hand, it is relatively easy to acquire large amounts of unlabeled images. Recently, Self-Supervised Learning (SSL) is proposed as a method that can learn from unlabeled images, potentially reducing the need for labeling. In this work, we propose a deep SSL method, called RS-FewShotSSL, for RS scene classification under the few shot scenario when we only have a few (less than 20) labeled scenes per class. Under this scenario, typical DL solutions that fine-tune CNN models, pre-trained on the ImageNet dataset, fail dramatically. In the SSL paradigm, a DL model is pre-trained from scratch during the pretext task using the large amounts of unlabeled scenes. Then, during the main or the so-called downstream task, the model is fine-tuned on the labeled scenes. Our proposed RS-FewShotSSL solution is composed of an online network and a target network both using the EfficientNet-B3 CNN model as a feature encoder backbone. During the pretext task, RS-FewShotSSL learns discriminative features from the unlabeled images using cross-view contrastive learning. Different views are generated from each image using geometric transformations and passed to the online and target networks. Then, the whole model is optimized by minimizing the cross-view distance between the online and target networks. To address the problem of limited computation resources available to us, our proposed method uses a novel DL architecture that can be trained using both high-resolution and low-resolution images. During the pretext task, RS-FewShotSSL is trained using low-resolution images, thereby, allowing for larger batch sizes which significantly boosts the performance of the proposed pipeline on the task of RS classification. In the downstream task, the target network is discarded, and the online network is fine-tuned using the few labeled shots or scenes. Here, we use smaller batches of both high-resolution and low-resolution images. This architecture allows RS-FewshotSSL to benefit from both large batch sizes and full image sizes, thereby learning from the large amounts of unlabeled data in an effective way. We tested RS-FewShotSSL on three RS public datasets, and it demonstrated a significant improvement compared to other state-of-the-art methods such as: SimCLR, MoCo, BYOL and IDSSL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jan 9, 2023
Citations: 2	License type: open-access

R Discovery Prime

R Discovery Prime

Self-supervised learning for remote sensing scene classification under the few shot scenario

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

A Generic Self-Supervised Learning (SSL) Framework for Representation Learning from Spectral–Spatial Features of Unlabeled Remote Sensing Imagery
Xin Zhang ... Liangxiu Han
Remote Sensing | VOL. 15
Xin Zhang, et. al.Xin Zhang ... Liangxiu Han
03 Nov 2023
Remote Sensing | VOL. 15

RS-DeepSuperLearner: fusion of CNN ensemble for remote sensing scene classification
Haikel Alhichri
Annals of GIS | VOL. 29
Haikel AlhichriHaikel Alhichri
02 Jan 2023
Annals of GIS | VOL. 29

When Self-Supervised Learning Meets Scene Classification: Remote Sensing Scene Classification Based on a Multitask Learning Framework
Zhicheng Zhao ... Ze Luo
Remote Sensing | VOL. 12
Zhicheng Zhao, et. al.Zhicheng Zhao ... Ze Luo
09 Oct 2020
Remote Sensing | VOL. 12

Learning Deep Cross-Modal Embedding Networks for Zero-Shot Remote Sensing Image Scene Classification
Yansheng Li ... Zhihui Zhu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 59
Yansheng Li, et. al.Yansheng Li ... Zhihui Zhu
14 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-supervised learning for remote sensing scene classification under the few shot scenario

Abstract

Talk to us

Similar Papers

More From: Scientific Reports