Integration Transformer for Ground-Based Cloud Image Segmentation

Shuang Liu,Zhong Zhang,Tariq S Durrani,Jiafeng Zhang,Xiaozhong Cao

doi:10.1109/tgrs.2023.3265384

Abstract

Recently, convolutional neural network (CNN) dominates the ground-based cloud image segmentation task, but disregards the learning of long-range dependencies due to the limited size of filters. Although Transformer-based methods could overcome this limitation, they only learn long-range dependencies at a single scale, hence failing to capture multi-scale information of cloud image. The multi-scale information is beneficial to ground-based cloud image segmentation, because the features from small scales tend to extract detailed information while features from large scales have the ability to learn global information. In this paper, we propose a novel deep network named Integration Transformer (InTransformer), which builds long-range dependencies from different scales. To this end, we propose the Hybrid Multi-head Transformer Block (HMTB) to learn multi-scale long-range dependencies, and hybridize CNN and HMTB as the encoder at different scales. The proposed InTransformer hybridizes CNN and Transformer as the encoder to extract multi-scale representations, which learns both local information and long-range dependencies with different scales. Meanwhile, in order to fuse the patch tokens with different scales, we propose Mutual Cross-Attention Module (MCAM) for the decoder of InTransformer which could adequately interact multi-scale patch tokens in a bidirectional way. We have conducted a series of experiments on large ground-based cloud detection database TLCDD and SWIMSEG. The experimental results show that the performance of our method outperforms other methods, proving the effectiveness of the proposed InTransformer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integration Transformer for Ground-Based Cloud Image Segmentation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Geoscience and Remote Sensing

Lead the way for us

Journal: IEEE Transactions on Geoscience and Remote Sensing	Publication Date: Jan 1, 2023
Citations: 1

Similar Papers

CloudSwinNet: A hybrid CNN-transformer framework for ground-based cloud images fine-grained segmentation
Chaojun Shi ... Xiaoyun Zhang
Energy | VOL. 309
Chaojun Shi, et. al.Chaojun Shi ... Xiaoyun Zhang
07 Sep 2024
Energy | VOL. 309

Ground-based Visible-light Cloud Image Classification based on a Convolutional Neural Network
Zhanhua Liu ... Min Wang
-
Zhanhua Liu, et. al.Zhanhua Liu ... Min Wang
01 Nov 2019
01 Nov 2019

Improved RepVGG ground-based cloud image classification with attention convolution
Chaojun Shi ... Xingkuan Li
Atmospheric Measurement Techniques | VOL. 17
Chaojun Shi, et. al.Chaojun Shi ... Xingkuan Li
09 Feb 2024
Atmospheric Measurement Techniques | VOL. 17

Notice of Violation of IEEE Publication Principles: Ground-Based Cloud Image Recognition System Based on Multi-CNN and Feature Screening and Fusion
Ma Jingyi ... Tiejun Zhang
IEEE Access | VOL. 8
Ma Jingyi, et. al.Ma Jingyi ... Tiejun Zhang
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integration Transformer for Ground-Based Cloud Image Segmentation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Geoscience and Remote Sensing