Abstract

Higher resolution building mapping from lower resolution remote sensing images is in great demand due to the lack of higher resolution data access, especially in the context of disaster assessment. High resolution building layout map is crucial for emergency rescue after the disaster. The emergency response time would be reduced if detailed building footprints were delineated from more easily available low-resolution data. To achieve this goal, we propose a super-resolution semantic segmentation network called ESPC_NASUnet, which consists of a feature super-resolution module and a semantic segmentation module. To the best of authors’ knowledge, this is the first work to systematically explore a deep learning-based approach to generate semantic maps with higher spatial resolution from lower spatial resolution remote sensing images in an end-to-end fashion. The experimental results for two datasets suggest that the proposed network is the best among four different end-to-end architectures in terms of both pixel-level metrics and object-level metrics. In terms of pixel-level $F$ 1-score, the improvements are greater than 0.068 and 0.055. Regarding the object-level $F$ 1-score, the disparities between ESPC_NASUnet and other end-to-end methods are more than 0.083 and 0.161 in the two datasets, respectively. Compared with stage-wise methods, our end-to-end network is less impacted by low-resolution input images. Finally, the proposed network produces building semantic maps comparable to those generated by semantic segmentation networks trained with high-resolution images and the ground truth utilizing the two datasets.

Highlights

  • R EMOTE sensing image interpretation is an important way to delineate buildings for urban planning

  • 1) We propose an effective end-to-end super-resolution semantic segmentation (SRSS) network, i.e., ESPC_NASUnet, to extract HR building maps from LR remote sensing images, in whose training phase HR images are not in need

  • A positive number indicates a higher value of the metric than that of ESPC_NASUnet

Read more

Summary

Introduction

R EMOTE sensing image interpretation is an important way to delineate buildings for urban planning. The poor efficiency and time-consuming nature of artificial interpretation have made automatic and semiautomatic building extraction. With the development of remote sensing imaging technologies, the spatial resolution of acquired data continues to improve. Building footprints extracted from remote sensing images are becoming more detailed. In some emergencies with time limitations such as disaster assessment, individual buildings need to be delineated [7] as quickly as possible. Some data with lower spatial resolution are open access. If these data could be utilized to produce semantic maps of buildings, the difficulty of HR data acquisition could be avoided

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.