Abstract

In the hyperspectral image (HSI) classification, there are numerous deep learning-based research routes that have emerged recently. Among them, two methodologies attract our attention. One is CNN-based classification and the other is transformer-based classification. The essence of these two methodologies is to interchange information locally or at a long distance for HSI pixels in the spatial or spectral-spatial domain. There are two principles underlying this essence—the information mixing mechanism and the information mixing domain. Although both CNN-based and transformer-based have made efforts in these two principles and obtained favorable classification results, there is still room for improvement in terms of accuracy and efficiency. To further enhance the accuracy and efficiency under the two principles, fast Fourier transform (FFT) is introduced to HSI classification and a fast Fourier filter is designed to mix information efficiently in the frequency domain by means of FFT. The parametric-free characteristic and fast computation of FFT can assist us in efficiently learning interactions among features in the frequency domain. Furthermore, a fast Fourier filter block is built upon the fast Fourier filter for repeatedly using as a basic block. In addition, we propose a spectral-spatial convolution tokenizer (SSCT) to extract shallow features and prepare spectral-spatial tokens for fast Fourier filter blocks. Finally, by employing SSCT and fast Fourier filter blocks, a novel deep neural network architecture—fast Fourier filter network (F <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> Net) is proposed for HSI classification. F <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> Net-P as a pyramidal variant of F <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> Net is also investigated. Experimental results on four datasets comprehensively evaluate our models and indicate that they are competitive with several current state-of-the-art methods, especially when the training samples are extremely limited. Specifically, F <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> Net-P achieves the highest accuracy of 97.25%, 98.08%, 97.49% and 97.95% on the four datasets, respectively, outperforming second best compared model by 1.49%, 2.03%, 2.14% and 1.94%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call