Local Image Features Research Articles

In recent years, the advancement of deep learning technology has led to excellent performance in synthetic aperture radar (SAR) automatic target recognition (ATR) technology. However, due to the interference of speckle noise, the task of classifying SAR images remains challenging. To address this issue, a multi-scale local–global feature fusion network (MFN) integrating a convolution neural network (CNN) and a transformer network was proposed in this study. The proposed network comprises three branches: a CovNeXt-SimAM branch, a Swin Transformer branch, and a multi-scale feature fusion branch. The CovNeXt-SimAM branch extracts local texture detail features of the SAR images at different scales. By incorporating the SimAM attention mechanism to the CNN block, the feature extraction capability of the model was enhanced from the perspective of spatial and channel attention. Additionally, the Swin Transformer branch was employed to extract SAR image global semantic information at different scales. Finally, the multi-scale feature fusion branch was used to fuse local features and global semantic information. Moreover, to overcome the problem of poor accuracy and inefficiency of the model due to empirically determined model hyperparameters, the Bayesian hyperparameter optimization algorithm was used to determine the optimal model hyperparameters. The model proposed in this study achieved average recognition accuracies of 99.26% and 94.27% for SAR vehicle targets under standard operating conditions (SOCs) and extended operating conditions (EOCs), respectively, on the MSTAR dataset. Compared with the baseline model, the recognition accuracy has been improved by 12.74% and 25.26%, respectively. The results demonstrated that Bayes-MFN reduces the inter-class distance of the SAR images, resulting in more compact classification features and less interference from speckle noise. Compared with other mainstream models, the Bayes-MFN model exhibited the best classification performance.

Read full abstract

Deep learning plays a highly essential role in the domain of remote sensing change detection (CD) due to its high efficiency. From some existing methods, we can observe that the fusion of information at each scale is quite vital for the accuracy of the CD results, especially for the common problems of pseudo-change and the difficult detection of change edges in the CD task. With this in mind, we propose a New Fusion network with Dual-branch Encoder and Triple-branch Decoder (DETDNet) that follows a codec structure as a whole, where the encoder adopts a siamese Res2Net-50 structure to extract the local features of the bitemporal images. As for the decoder in previous works, they usually employed a single branch, and this approach only preserved the fusion features of the encoder’s bitemporal images. Distinguished from these approaches, we adopt the triple-branch architecture in the decoder for the first time. The triple-branch structure preserves not only the dual-branch features from the encoder in the left and right branches, respectively, to learn the effective and powerful individual features of each temporal image but also the fusion features from the encoder in the middle branch. The middle branch utilizes triple-branch aggregation (TA) to realize the feature interaction of the three branches in the decoder, which enhances the integrated features and provides abundant and supplementary bitemporal feature information to improve the CD performance. The triple-branch architecture of the decoder ensures that the respective features of the bitemporal images as well as their fused features are preserved, making the feature extraction more integrated. In addition, the three branches employ a multiscale feature extraction module (MFE) per layer to extract multiscale contextual information and enhance the feature representation capability of the CD. We conducted comparison experiments on the BCDD, LEVIR-CD, and SYSU-CD datasets, which were created in New Zealand, the USA, and Hong Kong, respectively. The data were preprocessed to contain 7434, 10,192, and 20,000 image pairs, respectively. The experimental results show that DETDNet achieves F1 scores of 92.7%, 90.99%, and 81.13%, respectively, which shows better results compared to some recent works, which means that the model is more robust. In addition, the lower FP and FN indicate lower error and misdetection rates. Moreover, from the analysis of the experimental results, compared with some existing methods, the problem of pseudo-changes and the difficulty of detecting small change areas is better solved.

Read full abstract

Local Image Features Research Articles

Related Topics

Articles published on Local Image Features

Non-Contact Measurement of Pregnant Sows’ Backfat Thickness Based on a Hybrid CNN-ViT Model

A rotation-invariant corner detector based on the median of subpixelized triangle

AEA-Net:Affinity-supervised entanglement attentive network for person re-identification

Design of a Robust Hybrid Fuzzy Method for Medical Image Fusion

Implementation of Augmented Reality at Interactive Food Menu Using the Speed Up Robust Features (SURF) Algorithm

ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot Learning

Context-aware lightweight remote-sensing image super-resolution network.

Dynamic Weighting Network for Person Re-Identification.

A Multiscale Local–Global Feature Fusion Method for SAR Image Classification with Bayesian Hyperparameter Optimization Algorithm

Visual Image Decoding of Brain Activities Using a Dual Attention Hierarchical Latent Generative Network With Multiscale Feature Fusion

Perception-Oriented U-Shaped Transformer Network for 360-Degree No-Reference Image Quality Assessment

High-Quality Multispectral Image Reconstruction for the Spectral Camera Based on Ghost Imaging via Sparsity Constraints Using CoT-Unet

Semi-Dense Feature Matching with Transformers and its Applications in Multiple-View Geometry.

An improved checkerboard detection algorithm based on adaptive filters

CICHMKG: a large-scale and comprehensive Chinese intangible cultural heritage multimodal knowledge graph

HIGF-Net: Hierarchical information-guided fusion network for polyp segmentation based on transformer and convolution feature learning

New Fusion Network with Dual-Branch Encoder and Triple-Branch Decoder for Remote Sensing Image Change Detection

A CNN-Based Layer-Adaptive GCPs Extraction Method for TIR Remote Sensing Images

An End-to-End Framework Based on Vision-Language Fusion for Remote Sensing Cross-Modal Text-Image Retrieval

MLP-based classification of COVID-19 and skin diseases

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Local Image Features Research Articles

Related Topics

Articles published on Local Image Features

Non-Contact Measurement of Pregnant Sows’ Backfat Thickness Based on a Hybrid CNN-ViT Model

A rotation-invariant corner detector based on the median of subpixelized triangle

AEA-Net:Affinity-supervised entanglement attentive network for person re-identification

Design of a Robust Hybrid Fuzzy Method for Medical Image Fusion

Implementation of Augmented Reality at Interactive Food Menu Using the Speed Up Robust Features (SURF) Algorithm

ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot Learning

Context-aware lightweight remote-sensing image super-resolution network.

Dynamic Weighting Network for Person Re-Identification.

A Multiscale Local–Global Feature Fusion Method for SAR Image Classification with Bayesian Hyperparameter Optimization Algorithm

Visual Image Decoding of Brain Activities Using a Dual Attention Hierarchical Latent Generative Network With Multiscale Feature Fusion

Perception-Oriented U-Shaped Transformer Network for 360-Degree No-Reference Image Quality Assessment

High-Quality Multispectral Image Reconstruction for the Spectral Camera Based on Ghost Imaging via Sparsity Constraints Using CoT-Unet

Semi-Dense Feature Matching with Transformers and its Applications in Multiple-View Geometry.

An improved checkerboard detection algorithm based on adaptive filters

CICHMKG: a large-scale and comprehensive Chinese intangible cultural heritage multimodal knowledge graph

HIGF-Net: Hierarchical information-guided fusion network for polyp segmentation based on transformer and convolution feature learning

New Fusion Network with Dual-Branch Encoder and Triple-Branch Decoder for Remote Sensing Image Change Detection

A CNN-Based Layer-Adaptive GCPs Extraction Method for TIR Remote Sensing Images

An End-to-End Framework Based on Vision-Language Fusion for Remote Sensing Cross-Modal Text-Image Retrieval

MLP-based classification of COVID-19 and skin diseases