Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

Ziran Ye,Muye Gan,Yongyong Fu,Jinsong Deng,Ke Wang,Alexis Comber

doi:10.3390/rs11242970

Abstract

Automated methods to extract buildings from very high resolution (VHR) remote sensing data have many applications in a wide range of fields. Many convolutional neural network (CNN) based methods have been proposed and have achieved significant advances in the building extraction task. In order to refine predictions, a lot of recent approaches fuse features from earlier layers of CNNs to introduce abundant spatial information, which is known as skip connection. However, this strategy of reusing earlier features directly without processing could reduce the performance of the network. To address this problem, we propose a novel fully convolutional network (FCN) that adopts attention based re-weighting to extract buildings from aerial imagery. Specifically, we consider the semantic gap between features from different stages and leverage the attention mechanism to bridge the gap prior to the fusion of features. The inferred attention weights along spatial and channel-wise dimensions make the low level feature maps adaptive to high level feature maps in a target-oriented manner. Experimental results on three publicly available aerial imagery datasets show that the proposed model (RFA-UNet) achieves comparable and improved performance compared to other state-of-the-art models for building extraction.

Highlights

Automatic extraction of buildings from remote sensing imagery is of paramount importance in many application areas such as urban planning, population estimation, and disaster response [1]
We evaluated the effect of the proposed joint attention module in UNet for building extraction in the very high resolution (VHR) images
Applying the attention mechanism to the segmentation model UNet, we observe that our joint attention module improves the performance of existing architecture for the task of building extraction in VHR images

Summary

Introduction

Automatic extraction of buildings from remote sensing imagery is of paramount importance in many application areas such as urban planning, population estimation, and disaster response [1]. Assigning a semantic building class label to each pixel in very high resolution (VHR) imagery of urban areas is a challenging task because of high intra-class and low inter-class variabilities [2,3]. This is because in high resolution images, the building category contains many different sized manmade-objects in urban areas, where the amount of clutters is increasing—e.g., the shadow of tall buildings—the similarity of rooftops to some roads. The patch-based CNNs methods [9,10,11,12,13] were initially adopted for prediction in dense urban areas These patched-CNNs label the center pixel by processing an image patch through a neural network. Though FCN-based methods can produce dense pixel-wise output directly, the pixel-wise classification derived from the final score map is quite coarse because of the sequential sub-sampling operations in the FCN

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Dec 11, 2019
Citations: 50	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Auto-identification of linear archaeological traces of the Great Wall in northwest China using improved DeepLabv3+ from very high-resolution aerial imagery
Shu Yang ... Yiyang Chen
International Journal of Applied Earth Observation and Geoinformation | VOL. 113
Shu Yang, et. al.Shu Yang ... Yiyang Chen
01 Sep 2022
International Journal of Applied Earth Observation and Geoinformation | VOL. 113

A Fine-Grained Fully Convolutional Network For Extraction of Building Along High-Speed Rail Lines from VHR Remote Sensing Image
Wenfan Qiao ... Jicheng Wang
-
Wenfan Qiao, et. al.Wenfan Qiao ... Jicheng Wang
01 Jul 2019
01 Jul 2019

Deep Learning Applications on Very High-Resolution Aerial Imagery
Avinash Chouhan ... Dibyajyoti Chutia
-
Avinash Chouhan, et. al.Avinash Chouhan ... Dibyajyoti Chutia
05 Oct 2021
05 Oct 2021

Photovoltaic panel extraction from very high-resolution aerial imagery using region–line primitive association analysis and template matching
Min Wang ... Qiao Wang
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 141
Min Wang, et. al.Min Wang ... Qiao Wang
30 Apr 2018
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 141

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing