Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

Bakht Alam Khan,Jin-Woo Jung

doi:10.3390/app14093712

Abstract

This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model’s understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former’s 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model’s proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Apr 26, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Semantic Segmentation Uncertainty Assessment of Different U-net Architectures for Extracting Building Footprints
Ehsan Haghighi Gashti ... Jonathan Li
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. X-4-2024
Ehsan Haghighi Gashti, et. al.Ehsan Haghighi Gashti ... Jonathan Li
18 Oct 2024
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. X-4-2024

A novel approach for semantic segmentation of automatic road network extractions from remote sensing images by modified UNet
Miral J Patel ... Hasmukh P Koringa
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -
Miral J Patel, et. al.Miral J Patel ... Hasmukh P Koringa
04 Oct 2022
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -

Generative Learning for Postprocessing Semantic Segmentation Predictions: A Lightweight Conditional Generative Adversarial Network Based on Pix2pix to Improve the Extraction of Road Surface Areas
Calimanut-Ionut Cira ... Miguel-Ángel Manso-Callejo
Land | VOL. 10
Calimanut-Ionut Cira, et. al.Calimanut-Ionut Cira ... Miguel-Ángel Manso-Callejo
16 Jan 2021
Land | VOL. 10

Comparative Analysis of Different CNN Models for Building Segmentation from Satellite and UAV Images
Batuhan Sariturk ... Damla Kumbasar
Photogrammetric Engineering & Remote Sensing | VOL. 89
Batuhan Sariturk, et. al.Batuhan Sariturk ... Damla Kumbasar
01 Feb 2023
Photogrammetric Engineering & Remote Sensing | VOL. 89

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

Abstract

Talk to us

Similar Papers

More From: Applied Sciences