F3Net: Fast Fourier Filter Network for Hyperspectral Image Classification

Hao Shi,Di Yang,Youqiang Zhang,Yanbo Liu,Guo Cao,Zixian Ge

doi:10.1109/tim.2023.3277100

Abstract

In the hyperspectral image (HSI) classification, there are numerous deep learning-based research routes that have emerged recently. Among them, two methodologies attract our attention. One is CNN-based classification and the other is transformer-based classification. The essence of these two methodologies is to interchange information locally or at a long distance for HSI pixels in the spatial or spectral-spatial domain. There are two principles underlying this essence—the information mixing mechanism and the information mixing domain. Although both CNN-based and transformer-based have made efforts in these two principles and obtained favorable classification results, there is still room for improvement in terms of accuracy and efficiency. To further enhance the accuracy and efficiency under the two principles, fast Fourier transform (FFT) is introduced to HSI classification and a fast Fourier filter is designed to mix information efficiently in the frequency domain by means of FFT. The parametric-free characteristic and fast computation of FFT can assist us in efficiently learning interactions among features in the frequency domain. Furthermore, a fast Fourier filter block is built upon the fast Fourier filter for repeatedly using as a basic block. In addition, we propose a spectral-spatial convolution tokenizer (SSCT) to extract shallow features and prepare spectral-spatial tokens for fast Fourier filter blocks. Finally, by employing SSCT and fast Fourier filter blocks, a novel deep neural network architecture—fast Fourier filter network (F 3 Net) is proposed for HSI classification. F 3 Net-P as a pyramidal variant of F 3 Net is also investigated. Experimental results on four datasets comprehensively evaluate our models and indicate that they are competitive with several current state-of-the-art methods, especially when the training samples are extremely limited. Specifically, F 3 Net-P achieves the highest accuracy of 97.25%, 98.08%, 97.49% and 97.95% on the four datasets, respectively, outperforming second best compared model by 1.49%, 2.03%, 2.14% and 1.94%.

Full Text