Learning Token-Aligned Representations With Multimodel Transformers for Different-Resolution Change Detection

Mengxi Liu,Qian Shi,Jianlong Li,Zhuoqun Chai

doi:10.1109/tgrs.2022.3200684

Abstract

Different-resolution change detection (DRCD) is now becoming an urgent problem to be solved, which is of great potential in rapid monitoring, such as disaster assessment, urban expansion, etc. In DRCD tasks, bi-temporal inputs are given in the form of different resolutions, thus conventional CD methods cannot be applied directly. Previous studies have attempted to deal with this problem by reconstructing the low-resolution (LR) image into a high-resolution (HR) one, including interpolation and super-resolution (SR). However, these solutions are limited by the availability of training data, making it hard to meet different kinds of needs. Besides, these image-level strategies have also ignored the interaction and alignment of high-level features. Therefore, we propose a new approach based on multi-model Transformers (MM-Trans), which solves the resolution gaps of bi-temporal inputs in DRCD tasks from the perspective of feature alignment. In the MM-Trans, a weight-unshared feature extractor is first utilized to precisely capture the features of the different-resolution inputs; then a spatial-aligned Transformer (sp-Trans) is introduced to align the LR-image features to the same size of the HR-image ones, which can be optimized in a learnable way by an auxiliary token loss; after that, a semantic-aligned Transformer (se-Trans) is adopted, in which the bi-temporal features can be further interacted and aligned semantically; finally, a prediction head is employed to obtain fine-grained change results. Experiments conducted on three common CD datasets, CDD, S2Looking, and HTCD dataset, have shown the advancement of the MM-Trans and fully demonstrated its potential in DSCD tasks.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Token-Aligned Representations With Multimodel Transformers for Different-Resolution Change Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society

Lead the way for us

Journal: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society	Publication Date: Jan 1, 2022
Citations: 12

Similar Papers

Evaluation of super-resolution on 50 pancreatic cancer patients with real-time cine MRI from 0.35T MRgRT
Jaehee Chun ... Justin C Park
Biomedical Physics & Engineering Express | VOL. 7
Jaehee Chun, et. al.Jaehee Chun ... Justin C Park
18 Aug 2021
Evaluation of super-resolution on 50 pancreatic cancer patients with real-time cine MRI from 0.35T MRgRT
Jaehee Chun ... Justin C Park

Two-Dimensional Barcode Image Super-Resolution Reconstruction Via Sparse Representation
Gaosheng Yang ... Yuan Gao
-
Gaosheng Yang, et. al.Gaosheng Yang ... Yuan Gao
01 Jan 2013
01 Jan 2013

Learning Many-to-Many Mapping for Unpaired Real-World Image Super-resolution and Downscaling.
Wanjie Sun ... Zhenzhong Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP
Wanjie Sun, et. al.Wanjie Sun ... Zhenzhong Chen
01 Jan 2024
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP

Deep learning in computed tomography super resolution using multi-modality data training.
Wai Yan Ryana Fok ... Magdalena Herbst
Medical physics | VOL. 51
Wai Yan Ryana Fok, et. al.Wai Yan Ryana Fok ... Magdalena Herbst
16 Nov 2023
Medical physics | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Token-Aligned Representations With Multimodel Transformers for Different-Resolution Change Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society