Transformer-based multimodal change detection with multitask consistency constraints

Biyuan Liu,Huaixin Chen,Kun Li,Michael Ying Yang

doi:10.1016/j.inffus.2024.102358

Biyuan Liu, Huaixin Chen + Show 2 more

Open Access

https://doi.org/10.1016/j.inffus.2024.102358

Copy DOI

Export

Save

Cite

Journal: Information Fusion	Publication Date: Mar 24, 2024
Citations: 4	License type: cc-by-nc-nd

Abstract
Full-Text
Similar Papers

Abstract

Listen

Change detection plays a fundamental role in Earth observation for analyzing temporal iterations over time. However, recent studies have largely neglected the utilization of multimodal data that presents significant practical and technical advantages compared to single-modal approaches. This research focuses on leveraging pre-event digital surface model (DSM) data and post-event digital aerial images captured at different times for detecting change beyond 2D. We observe that the current change detection methods struggle with the multitask conflicts between semantic and height change detection tasks. To address this challenge, we propose an efficient Transformer-based network that learns shared representation between cross-dimensional inputs through cross-attention. It adopts a consistency constraint to establish the multimodal relationship. Initially, pseudo-changes are derived by employing height change thresholding. Subsequently, the L2 distance between semantic and pseudo-changes within their overlapping regions is minimized. This explicitly endows the height change detection (regression task) and semantic change detection (classification task) with representation consistency. A DSM-to-image multimodal dataset encompassing three cities in the Netherlands was constructed. It lays a new foundation for beyond-2D change detection from cross-dimensional inputs. Compared to five state-of-the-art change detection methods, our model demonstrates consistent multitask superiority in terms of semantic and height change detection. Furthermore, the consistency strategy can be seamlessly adapted to the other methods, yielding promising improvements.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Transformer-based multimodal change detection with multitask consistency constraints

Abstract

Published Version

Talk to us

Similar Papers

More From: Information Fusion

Lead the way for us

Similar Papers

Semantic feature-constrained multitask siamese network for building change detection in high-spatial-resolution remote sensing imagery
Qian Shen ... Xin Zhang
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 189
Qian Shen, et. al.Qian Shen ... Xin Zhang
12 May 2022
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 189

Spatial-Temporal Semantic Perception Network for Remote Sensing Image Semantic Change Detection
You He ... Xiaogang Ning
Remote Sensing | VOL. 15
You He, et. al.You He ... Xiaogang Ning
20 Aug 2023
Remote Sensing | VOL. 15

Weakly Supervised Silhouette-based Semantic Scene Change Detection
Ken Sakurada ... Weimin Wang
-
Ken Sakurada, et. al.Ken Sakurada ... Weimin Wang
01 May 2020
01 May 2020

Large-scale deep learning based binary and semantic change detection in ultra high resolution remote sensing imagery: From benchmark datasets to urban application
Shiqi Tian ... Liangpei Zhang
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 193
Shiqi Tian, et. al.Shiqi Tian ... Liangpei Zhang
24 Sep 2022
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 193

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Transformer-based multimodal change detection with multitask consistency constraints

Abstract

Published Version

Talk to us

Similar Papers

More From: Information Fusion