Transformer-based convolutional neural network approach for remote sensing natural scene classification

Arrun Sivasubramanian,Vr Prashanth,Theivaprakasham Hari,V Sowmya,E.A Gopalakrishnan,Vinayakumar Ravi

doi:10.1016/j.rsase.2023.101126

Abstract

Feature extraction in remote sensing is a challenging yet crucial operation for scene classification because of cloud cover and overlapping edges present in the data. Many architectures have been solely used as a backbone for feature extraction in complex computer vision tasks such as object detection and semantic segmentation. Though the remote sensing literature has compared deep learning models for scene classification, the comparison between different transformer-based architectures and convolution-based architectures has not been systematically addressed in the literature. Thus, this work comprehensively analyses different deep learning architectures on multiple scene classification datasets to understand the features and weigh the advantages of one or more functional connections in different convolutional neural networks. It has been done using five open-source benchmark datasets: UC-Merced land use, WHU-RS19, Optimal-31, RSI-CB256, and MLRSNet. Feature extraction for remote sensing natural scene classification is performed using pre-trained ImageNet22 k weights of convolution-based architectures such as VGG-16, ResNet50, EfficientNetB3 and ConvNeXt, and transformer architectures such as Vision transformers (ViT) and Swin transformers. Further, the networks are fine-tuned using the LinBnDrop block from the fastai framework before scene classification using the softmax layer. Our work obtains a new benchmark for all datasets on a 90:10 train-test split. An explanation to understand the use of each architecture based on the available data and its application is discussed in this work. The analysis of 42 experiments conducted in this work will help the research community analyze the scene classification datasets and give them better insights into fine-tuning applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer-based convolutional neural network approach for remote sensing natural scene classification

Abstract

Talk to us

Similar Papers

More From: Remote Sensing Applications: Society and Environment

Lead the way for us

Journal: Remote Sensing Applications: Society and Environment	Publication Date: Dec 19, 2023
Citations: 1

Similar Papers

A survey of remote sensing image classification based on CNNs
Jia Song ... Chenyan Ma
Big Earth Data | VOL. 3
Jia Song, et. al.Jia Song ... Chenyan Ma
03 Jul 2019
Big Earth Data | VOL. 3

Inter-dependent CNNs for joint scene and object recognition
Jawadul Hasan Bappy ... Amit K Roy-Chowdhury
-
Jawadul Hasan Bappy, et. al.Jawadul Hasan Bappy ... Amit K Roy-Chowdhury
01 Dec 2016
01 Dec 2016

Remote Sensing Big Data: Theory, Methods and Applications
Peng Liu ... Liping Di
Remote Sensing | VOL. 10
Peng Liu, et. al.Peng Liu ... Liping Di
04 May 2018
Remote Sensing | VOL. 10

Accuracy Assessment in Convolutional Neural Network-Based Deep Learning Remote Sensing Studies—Part 2: Recommendations and Best Practices
Aaron E Maxwell ... Luis Andrés Guillén
Remote Sensing | VOL. 13
Aaron E Maxwell, et. al.Aaron E Maxwell ... Luis Andrés Guillén
02 Jul 2021
Remote Sensing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer-based convolutional neural network approach for remote sensing natural scene classification

Abstract

Talk to us

Similar Papers

More From: Remote Sensing Applications: Society and Environment