An Explainable Spatial–Frequency Multiscale Transformer for Remote Sensing Scene Classification

Yuting Yang,Fang Liu,Lingling Li,Shuyuan Yang,Licheng Jiao,Puhua Chen,Xu Liu

doi:10.1109/tgrs.2023.3265361

Abstract

Deep convolutional neural networks (CNNs) are significant in remote sensing. Due to the strong local representation learning ability, CNNs have excellent performance in remote sensing scene classification. However, CNNs focus on location-sensitive representations in the spatial domain and lack contextual information mining capabilities. Meanwhile, remote sensing scene classification still faces challenges, such as complex scenes and significant differences in target sizes. To address the problems and challenges above, more robust feature representation learning networks are necessary. In this paper, a novel and explainable spatial-frequency multi-scale Transformer framework, SF-MSFormer, is proposed for remote sensing scene classification. It mainly comprises spatial-domain and frequency-domain multi-scale Transformer branches, which consider the spatial-frequency global multi-scale representation features. Besides, the texture-enhanced encoder is designed in the frequency-domain multi-scale Transformer branch, which is adaptive to capture the global texture features. In addition, an adaptive feature aggregation module is designed to integrate the spatial-frequency multi-scale feature for final recognition. The experimental results verify the effectiveness of SF-MSFormer and show better convergence. It achieves state-of-the-art results (98.72%, 98.6%, 99.72%, and 94.83% overall accuracies, respectively) on the AID, UCM, WHU-RS19, and NWPU-RESISC45 datasets. Besides, the feature visualizations evaluate the explainability of the texture-enhanced encoder. The code implementation of this article will be available at https://github.com/yutinyang/SF-MSFormer.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Explainable Spatial–Frequency Multiscale Transformer for Remote Sensing Scene Classification

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society

Lead the way for us

Journal: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society	Publication Date: Jan 1, 2023
Citations: 9

Similar Papers

Enhanced Feature Pyramid Network With Deep Semantic Embedding for Remote Sensing Scene Classification
Xin Wang ... Shiyi Wang
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 59
Xin Wang, et. al.Xin Wang ... Shiyi Wang
05 Jan 2021
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 59

Lightweight Channel Attention and Multiscale Feature Fusion Discrimination for Remote Sensing Scene Classification
Huiyao Wan ... Zheng Zhou
IEEE access : practical innovations, open solutions | VOL. 9
Huiyao Wan, et. al.Huiyao Wan ... Zheng Zhou
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 9

A Multi-Scale Approach for Remote Sensing Scene Classification Based on Feature Maps Selection and Region Representation
Jun Zhang ... Bin Pan
Remote sensing | VOL. 11
Jun Zhang, et. al.Jun Zhang ... Bin Pan
25 Oct 2019
Remote sensing | VOL. 11

Multi-scale attentive region adaptive aggregation learning for remote sensing scene classification
Guangrui Lv ... Wenhai Xu
International Journal of Remote Sensing | VOL. 42
Guangrui Lv, et. al.Guangrui Lv ... Wenhai Xu
21 Sep 2021
International Journal of Remote Sensing | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Explainable Spatial–Frequency Multiscale Transformer for Remote Sensing Scene Classification

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society