Temporal ViT-U-Net Tandem Model: Enhancing Multi-Sensor Land Cover Classification Through Transformer-Based Utilization of Satellite Image Time Series

Mohammadreza Heidarianbaei,Hubert Kanyamahanga,Mareike Dorozynski

doi:10.5194/isprs-annals-x-3-2024-169-2024

Abstract

Abstract. Semantic segmentation is essential in the field of remote sensing because it is used for various applications such as environmental monitoring and land cover classification. Recent advancements aim to collectively classify data from diverse sensors and epochs to improve predictive accuracy. With the availability of vast Satellite Image Time Series (SITS) data, supervised deep learning methods, such as Transformer models, become viable options. This paper introduces the Temporal Vision Transformer(ViT), designed to extract features from SITS. These features, capturing the temporal patterns of land cover classes, are integrated with features derived from aerial imagery to improve land cover classification. Drawing inspiration from the success of transformers in Natural language processing (NLP), Temporal ViT concurrently extracts spatial and temporal information from SITS data using tailored positional encoding strategies. The proposed approach fosters comprehensive feature learning across both domains, facilitating seamless integration of encoded data from SITS into aerial images. Furthermore, a training strategy is proposed that supports the Temporal ViT to focus on classes with a changing appearance over the year. Extensive experiments carried out in this work indicate the enhanced classification performance of Temporal ViT compared to existing state-of-the-art techniques for multi-modal land cover classification. Our model achieves a 3.8% increase in the mean IoU compared to the network solely relying on aerial images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal ViT-U-Net Tandem Model: Enhancing Multi-Sensor Land Cover Classification Through Transformer-Based Utilization of Satellite Image Time Series

Abstract

Talk to us

Similar Papers

More From: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

Lead the way for us

Journal: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences	Publication Date: Nov 4, 2024
License type: CC BY 4.0

Similar Papers

Deep Temporal Iterative Clustering for Satellite Image Time Series Land Cover Analysis
Wenqi Guo ... Ping Tang
Remote Sensing | VOL. 14
Wenqi Guo, et. al.Wenqi Guo ... Ping Tang
29 Jul 2022
Remote Sensing | VOL. 14

A spatio-temporal feature extraction algorithm for crop mapping using satellite image time-series data

-

01 Dec 2017
01 Dec 2017

EVALUATION OF MULTIPLE KERNEL LEARNING ALGORITHMS FOR CROP MAPPING USING SATELLITE IMAGE TIME-SERIES DATA
S Niazmardi ... A Safari
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-4/W4
S Niazmardi, et. al.S Niazmardi ... A Safari
27 Sep 2017
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-4/W4

Unsupervised Domain Adaptation Techniques for Classification of Satellite Image Time Series
Benjamin Lucas ... Geoffrey I Webb
-
Benjamin Lucas, et. al.Benjamin Lucas ... Geoffrey I Webb
26 Sep 2020
26 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal ViT-U-Net Tandem Model: Enhancing Multi-Sensor Land Cover Classification Through Transformer-Based Utilization of Satellite Image Time Series

Abstract

Talk to us

Similar Papers

More From: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences