Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images.

Yan Zhang,Xinbo Gao,Xiyuan Gao,Xiao Pu,Jiaxu Leng,Qingyan Duan

doi:10.1109/tnnls.2023.3319363

Abstract

Very high-resolution (VHR) remote sensing (RS) image classification is the fundamental task for RS image analysis and understanding. Recently, Transformer-based models demonstrated outstanding potential for learning high-order contextual relationships from natural images with general resolution ( ≈ 224 × 224 pixels) and achieved remarkable results on general image classification tasks. However, the complexity of the naive Transformer grows quadratically with the increase in image size, which prevents Transformer-based models from VHR RS image ( ≥ 500 × 500 pixels) classification and other computationally expensive downstream tasks. To this end, we propose to decompose the expensive self-attention (SA) into real and imaginary parts via discrete Fourier transform (DFT) and, therefore, propose an efficient complex SA (CSA) mechanism. Benefiting from the conjugated symmetric property of DFT, CSA is capable to model the high-order contextual information with less than half computations of naive SA. To overcome the gradient explosion in Fourier complex field, we replace the Softmax function with the carefully designed Logmax function to normalize the attention map of CSA and stabilize the gradient propagation. By stacking various layers of CSA blocks, we propose the Fourier complex Transformer (FCT) model to learn global contextual information from VHR aerial images following the hierarchical manners. Universal experiments conducted on commonly used RS classification datasets demonstrate the effectiveness and efficiency of FCT, especially on VHR RS images. The source code of FCT will be available at https://github.com/Gao-xiyuan/FCT.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Jan 1, 2024
Citations: 2

Similar Papers

Multi-Modality and Multi-Scale Attention Fusion Network for Land Cover Classification from VHR Remote Sensing Images
Tao Lei ... Zhiyong Lv
Remote Sensing | VOL. 13
Tao Lei, et. al.Tao Lei ... Zhiyong Lv
20 Sep 2021
Remote Sensing | VOL. 13

Method Based on Edge Constraint and Fast Marching for Road Centerline Extraction from Very High-Resolution Remote Sensing Images
Lipeng Gao ... Zhiyong Lv
Remote Sensing | VOL. 10
Lipeng Gao, et. al.Lipeng Gao ... Zhiyong Lv
07 Jun 2018
Remote Sensing | VOL. 10

A Generic FCN-Based Approach for the Road-Network Extraction From VHR Remote Sensing Images – Using OpenStreetMap as Benchmarks
Deng Pan ... Meng Zhang
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Deng Pan, et. al.Deng Pan ... Meng Zhang
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

Hierarchical spatial features learning with deep CNNs for very high-resolution remote sensing image classification
Guangyun Zhang ... Xiuping Jia
International Journal of Remote Sensing | VOL. 39
Guangyun Zhang, et. al.Guangyun Zhang ... Xiuping Jia
22 Aug 2018
International Journal of Remote Sensing | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems