Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images.

Junhao Chang,Gang Cen,Yuefeng Cen

doi:10.3390/s24196198

Junhao Chang, Gang Cen + Show 1 more

Open Access

https://doi.org/10.3390/s24196198

Copy DOI

Export

Save

Cite

Journal: Sensors (Basel, Switzerland)	Publication Date: Sep 25, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

The accurate extraction of buildings from remote sensing images is crucial in fields such as 3D urban planning, disaster detection, and military reconnaissance. In recent years, models based on Transformer have performed well in global information processing and contextual relationship modeling, but suffer from high computational costs and insufficient ability to capture local information. In contrast, convolutional neural networks (CNNs) are very effective in extracting local features, but have a limited ability to process global information. In this paper, an asymmetric network (CTANet), which combines the advantages of CNN and Transformer, is proposed to achieve efficient extraction of buildings. Specifically, CTANet employs ConvNeXt as an encoder to extract features and combines it with an efficient bilateral hybrid attention transformer (BHAFormer) which is designed as a decoder. The BHAFormer establishes global dependencies from both texture edge features and background information perspectives to extract buildings more accurately while maintaining a low computational cost. Additionally, the multiscale mixed attention mechanism module (MSM-AMM) is introduced to learn the multiscale semantic information and channel representations of the encoder features to reduce noise interference and compensate for the loss of information in the downsampling process. Experimental results show that the proposed model achieves the best F1-score (86.7%, 95.74%, and 90.52%) and IoU (76.52%, 91.84%, and 82.68%) compared to other state-of-the-art methods on the Massachusetts building dataset, the WHU building dataset, and the Inria aerial image labeling dataset.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images.

Abstract

Published Version

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Building Extraction from Remote Sensing Images with Sparse Token Transformers
Keyan Chen ... Zhengxia Zou
Remote Sensing | VOL. 13
Keyan Chen, et. al.Keyan Chen ... Zhengxia Zou
05 Nov 2021
Remote Sensing | VOL. 13

BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction From High-Resolution Remote Sensing Imagery
Yuan Zhou ... Zhanlong Chen
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Yuan Zhou, et. al.Yuan Zhou ... Zhanlong Chen
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

MSL-Net: An Efficient Network for Building Extraction from Aerial Imagery
Yue Qiu ... Chengyi Liu
Remote Sensing | VOL. 14
Yue Qiu, et. al.Yue Qiu ... Chengyi Liu
12 Aug 2022
Remote Sensing | VOL. 14

Multi-channel recurrent attention network for building extraction from high resolution remote sensing images
Zhen Wang ... Jianxin Guo
Measurement Science and Technology | VOL. 33
Zhen Wang, et. al.Zhen Wang ... Jianxin Guo
11 Feb 2022
Measurement Science and Technology | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images.

Abstract

Published Version

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)