Global Clue-Guided Cross-Memory Quaternion Transformer Network for Multisource Remote Sensing Data Classification.

Wen-Shuai Hu,Wei Li,Heng-Chao Li,Ran Tao,Feng-Hua Huang

doi:10.1109/tnnls.2024.3406735

Abstract

Multisource remote sensing data classification is a challenging research topic, and how to address the inherent heterogeneity between multimodal data while exploring their complementarity is crucial. Existing deep learning models usually directly adopt feature-level fusion designs, most of which, however, fail to overcome the impact of heterogeneity, limiting their performance. As such, a multimodal joint classification framework, called global clue-guided cross-memory quaternion transformer network (GCCQTNet), is proposed for multisource data i.e., hyperspectral image (HSI) and synthetic aperture radar (SAR)/light detection and ranging (LiDAR) classification. First, a three-branch structure is built to extract the local and global features, where an independent squeeze-expansion-like fusion (ISEF) structure is designed to update the local and global representations by considering the global information as an agent, suppressing the negative impact of multimodal heterogeneity layer by layer. A cross-memory quaternion transformer (CMQT) structure is further constructed to model the complex inner relationships between the intramodality and intermodality features to capture more discriminative fusion features that fully characterize multimodal complementarity. Finally, a cross-modality comparative learning (CMCL) structure is developed to impose the consistency constraint on global information learning, which, in conjunction with a classification head, is used to guide the end-to-end training of GCCQTNet. Extensive experiments on three public multisource remote sensing datasets illustrate the superiority of our GCCQTNet with regards to other state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Global Clue-Guided Cross-Memory Quaternion Transformer Network for Multisource Remote Sensing Data Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Similar Papers

Global–Local Transformer Network for HSI and LiDAR Data Joint Classification
Kexing Ding ... Shutao Li
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Kexing Ding, et. al.Kexing Ding ... Shutao Li
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Multimodal Transformer Network for Hyperspectral and LiDAR Classification
Yiyan Zhang ... Chenkai Zhang
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Yiyan Zhang, et. al.Yiyan Zhang ... Chenkai Zhang
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

SAR and LIDAR fusion: experiments and applications
Matthew C Edwards ... Ryan D Bowden
-
Matthew C Edwards, et. al.Matthew C Edwards ... Ryan D Bowden
31 May 2013
31 May 2013

Locality Preserving Composite Kernel Feature Extraction for Multi-Source Geospatial Image Analysis
Yuhang Zhang ... Saurabh Prasad
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 8
Yuhang Zhang, et. al.Yuhang Zhang ... Saurabh Prasad
01 Mar 2015
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Global Clue-Guided Cross-Memory Quaternion Transformer Network for Multisource Remote Sensing Data Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems