CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification

Wenxuan He,Jingwen Yan,Shuhong Liao,Zhen Xu,Weiliang Huang

doi:10.1109/jstars.2022.3216335

Wenxuan He, Jingwen Yan + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/jstars.2022.3216335

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

The hyperspectral image (HSI) has nearly continuous spectral information, thus, the target of interest can be accurately identified by the subtle details of spectral properties. Spectral resolution at different scales can capture different levels of spectral features: small-scale spectral bands are beneficial for extracting global details in vision transformers, while large-scale spectral bands are more effective for local features. Transformer shows advantages in global information extraction with self-attention module and even surpasses CNNs in various tasks. Some works based on the vision transformer have performed surprisingly in HSI classification. However, single-scale vision transformers are insufficient to balance the extraction of local details and redundancy on different scales. The recent work, a multi-scale vision transformer, has provided a solution with spatial patch-wise features in image classification. Inspired by this, we propose the Cross-spectral vision transformer (CSiT) with two branches to extract pixel-wise multi-scale features and further design a multi-scale spectral embedding module to enhance local details between neighboring spectral bands. Moreover, based on the cross-attention operation, a single token for each branch is recognized as a query and used to exchange information with other branches. We evaluate the classification performance of the proposed CSiT in three classic HSI datasets with extensive experiments, showing the multi-scale vision transformer architecture has a promising result for HSI classification with one-dimensional spectral bands.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	Publication Date: Jan 1, 2022
Citations: 17	License type: CC BY 4.0

R Discovery Prime

CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Lead the way for us

Similar Papers

BS2T: Bottleneck Spatial–Spectral Transformer for Hyperspectral Image Classification
Ruoxi Song ... Yining Feng
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Ruoxi Song, et. al.Ruoxi Song ... Yining Feng
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Multiple deep-belief-network-based spectral-spatial classification of hyperspectral images
Atif Mughees ... Linmi Tao
Tsinghua Science and Technology | VOL. 24
Atif Mughees, et. al.Atif Mughees ... Linmi Tao
01 Apr 2019
Tsinghua Science and Technology | VOL. 24

Deep Learning Integrated with Multiscale Pixel and Object Features for Hyperspectral Image Classification
Meng Zhang ... Liang Hong
-
Meng Zhang, et. al.Meng Zhang ... Liang Hong
01 Aug 2018
01 Aug 2018

Hyperspectral Image Classification Based on Two-Stage Subspace Projection
Xiaoyan Li ... Lefei Zhang
Remote Sensing | VOL. 10
Xiaoyan Li, et. al.Xiaoyan Li ... Lefei Zhang
30 Sep 2018
Remote Sensing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing