Abstract
In recent years, the transformer-based approach becoming a hot research topic in hyperspectral image (HSI) classification tasks. However, most of these studies have focused on optimizing the model framework in pursuit of high-accuracy classification, with little attention to the composition of the input token sequence as an important factor affecting the performance of the transformer. Therefore, this paper further explores a novel token structure to strengthen the Transformer's performance for HSI classification tasks, based on which a Multi-Range Spectral-Spatial Transformer (MRSST) framework is developed. Specifically, a convolutional feature pre-encoder with two branches is designed to extract shallow features for each spectral channel separately. Then, a token generator is introduced to combine the shallow features with the raw spectral information to yield the token sequences with multi-range information. Finally, the tokens are input into the transformer encoders enhanced by a module that strengthens the information exchange between its mid-range and short-range semantic features. Experiments conducted on three well-known hyperspectral datasets demonstrate that the proposed multi-range composite token sequences and information exchange mechanisms significantly enhance the transformer's performance. Codes are released at: https://github.com/HyperSystemAndImageProc/Multi-Range-Spectral-Spatial-Transformer.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.