MSFTrans: a multi-task frequency-spatial learning transformer for building extraction from high spatial resolution remote sensing images

Bo Yu,Fang Chen,Ning Wang,Lu Yang,Haiping Yang,Lei Wang

doi:10.1080/15481603.2022.2143678

Abstract

ABSTRACT Building extraction is significant in urban planning, economic evaluation, and driverless technology development. However, automatic building extraction from high spatial resolution remote sensing images has been a challenging task due to the various building shapes and colors, imaging conditions, and complex background objects. Current methods in building extraction are generally based on deep convolution networks, and they mostly use an encoder-decoder architecture, wherein detailed building features and small buildings are easily omitted in continuous convolution operations. Moreover, buildings with blurred boundaries are only completely extracted with difficulty. To meet these challenges, we propose a multi-task architecture of frequency-spatial learning Transformer to extract buildings from high spatial resolution remote sensing images. Different from current architecture, we designed a frequency-spatial learning module in the framework of multi-task to synthesize the multi-scale spatial features and frequency decomposition features of high-resolution image. Spiking convolution is proposed in this study to enhance the frequency features of buildings by mimicking the neural transmission in human brains. In this way, multi-scale building features can be better preserved and distinguished from background objects. Moreover, a masked-attention Transformer is adopted to improve multi-scale building mask prediction accuracy by synthesizing successive pixel-wise up-sampled feature maps. We also propose a strategy to evaluate the practical transferability of the proposed method by mimicking practical application cases through training and evaluating images with different spatial resolutions from different study areas and datasets. Experiments using five public building datasets (WHU-Building Satellite Dataset I, WHU-Building Satellite Dataset II, Massachusetts Buildings Dataset, Inria Aerial Image Dataset, xBD Building Dataset) demonstrate the strong potential applicability of our proposed method for practical application cases. Our method outperforms five recently proposed state-of-the-art semantic segmentation methods with 36.60% accuracy improvement on extracted buildings and approximately 53.55% recall progress in extracting small building instances. The implementation code will be released after the paper is published.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: GIScience & Remote Sensing	Publication Date: Nov 16, 2022
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

MSFTrans: a multi-task frequency-spatial learning transformer for building extraction from high spatial resolution remote sensing images

Abstract

Talk to us

Similar Papers

More From: GIScience & Remote Sensing

Lead the way for us

Similar Papers

Utilizing Bounding Box Annotations for Weakly Supervised Building Extraction From Remote-Sensing Images
Daoyuan Zheng ... Yuanyuan Liu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Daoyuan Zheng, et. al.Daoyuan Zheng ... Yuanyuan Liu
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

Combining Deep Fully Convolutional Network and Graph Convolutional Neural Network for the Extraction of Buildings from Aerial Images
Wenzhuo Zhang ... Shuai Xu
Buildings | VOL. 12
Wenzhuo Zhang, et. al.Wenzhuo Zhang ... Shuai Xu
15 Dec 2022
Buildings | VOL. 12

BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction From High-Resolution Remote Sensing Imagery
Yuan Zhou ... Daozhu Xu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Yuan Zhou, et. al.Yuan Zhou ... Daozhu Xu
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Cropland encroachment detection via dual attention and multi-loss based building extraction in remote sensing images.
Junshu Wang ... Mingrui Cai
Frontiers in plant science | VOL. 13
Junshu Wang, et. al.Junshu Wang ... Mingrui Cai
06 Sep 2022
Frontiers in plant science | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MSFTrans: a multi-task frequency-spatial learning transformer for building extraction from high spatial resolution remote sensing images

Abstract

Talk to us

Similar Papers

More From: GIScience &amp; Remote Sensing

More From: GIScience & Remote Sensing