BSDSNet: Dual-Stream Feature Extraction Network Based on Segment Anything Model for Synthetic Aperture Radar Land Cover Classification

Yangyang Wang,Weidong Chen,Chang Chen,Wengang Zhang

doi:10.3390/rs16071150

Abstract

Land cover classification using high-resolution Polarimetric Synthetic Aperture Radar (PolSAR) images obtained from satellites is a challenging task. While deep learning algorithms have been extensively studied for PolSAR image land cover classification, the performance is severely constrained due to the scarcity of labeled PolSAR samples and the limited domain acceptance of models. Recently, the emergence of the Segment Anything Model (SAM) based on the vision transformer (VIT) model has brought about a revolution in the study of specific downstream tasks in computer vision. Benefiting from its millions of parameters and extensive training datasets, SAM demonstrates powerful capabilities in extracting semantic information and generalization. To this end, we propose a dual-stream feature extraction network based on SAM, i.e., BSDSNet. We change the image encoder part of SAM to a dual stream, where the ConvNext image encoder is utilized to extract local information and the VIT image encoder is used to extract global information. BSDSNet achieves an in-depth exploration of semantic and spatial information in PolSAR images. Additionally, to facilitate a fine-grained amalgamation of information, the SA-Gate module is employed to integrate local–global information. Compared to previous deep learning models, BSDSNet’s impressive ability to represent features is akin to a versatile receptive field, making it well suited for classifying PolSAR images across various resolutions. Comprehensive evaluations indicate that BSDSNet achieves excellent results in qualitative and quantitative evaluation when performing classification tasks on the AIR-PolSAR-Seg dataset and the WHU-OPT-SAR dataset. Compared to the suboptimal results, our method improves the Kappa metric by 3.68% and 0.44% on the AIR-PolSAR-Seg dataset and the WHU-OPT-SAR dataset, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Mar 26, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

BSDSNet: Dual-Stream Feature Extraction Network Based on Segment Anything Model for Synthetic Aperture Radar Land Cover Classification

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Exploring the Performance of Different Texture Information and Polarization Features from PolSAR Images in Urban Land Cover Classification
Songjing Guo ... Qimin Cheng
Photogrammetric Engineering & Remote Sensing | VOL. 87
Songjing Guo, et. al.Songjing Guo ... Qimin Cheng
01 Feb 2021
Photogrammetric Engineering & Remote Sensing | VOL. 87

Multi-Feature Segmentation for High-Resolution Polarimetric SAR Data Based on Fractal Net Evolution Approach
Qihao Chen ... Xiuguo Liu
Remote Sensing | VOL. 9
Qihao Chen, et. al.Qihao Chen ... Xiuguo Liu
06 Jun 2017
Remote Sensing | VOL. 9

A level set method for segmentation of high-resolution polarimetric SAR images using a heterogeneous clutter model
Pengfei Zou ... Lijie Guo
Remote Sensing Letters | VOL. 6
Pengfei Zou, et. al.Pengfei Zou ... Lijie Guo
25 Jun 2015
Remote Sensing Letters | VOL. 6

Superpixel Segmentation for PolSAR Images Based on Geodesic Distance
...
-
, et. al. ...
25 Feb 2021
25 Feb 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BSDSNet: Dual-Stream Feature Extraction Network Based on Segment Anything Model for Synthetic Aperture Radar Land Cover Classification

Abstract

Talk to us

Similar Papers

More From: Remote Sensing