SSIM Metrics Research Articles

Before 2008, China lacked high-coverage regional surface observation data, making it difficult for the China Meteorological Administration Land Data Assimilation System (CLDAS) to directly backtrack high-resolution, high-quality land assimilation products. To address this issue, this paper proposes a deep learning model named UNET_DCA, based on the UNET architecture, which incorporates a Dual Cross-Attention module (DCA) for multiscale feature fusion by introducing Channel Cross-Attention (CCA) and Spatial Cross-Attention (SCA) mechanisms. This model focuses on the near-surface 10-m wind field and achieves spatial downscaling from 6.25 km to 1 km. We conducted training and validation using data from 2020–2021, tested with data from 2019, and performed ablation experiments to validate the effectiveness of each module. We compared the results with traditional bilinear interpolation methods and the SNCA-CLDASSD model. The experimental results show that the UNET-based model outperforms SNCA-CLDASSD, indicating that the UNET-based model captures richer information in wind field downscaling compared to SNCA-CLDASSD, which relies on sequentially stacked CNN convolution modules. UNET_CCA and UNET_SCA, incorporating cross-attention mechanisms, outperform UNET without attention mechanisms. Furthermore, UNET_DCA, incorporating both Channel Cross-Attention and Spatial Cross-Attention mechanisms, outperforms UNET_CCA and UNET_SCA, which only incorporate one attention mechanism. UNET_DCA performs best on the RMSE, MAE, and COR metrics (0.40 m/s, 0.28 m/s, 0.93), while UNET_DCA_ars, incorporating more auxiliary information, performs best on the PSNR and SSIM metrics (29.006, 0.880). Evaluation across different methods indicates that the optimal model performs best in valleys, followed by mountains, and worst in plains; it performs worse during the day and better at night; and as wind speed levels increase, accuracy decreases. Overall, among various downscaling methods, UNET_DCA and UNET_DCA_ars effectively reconstruct the spatial details of wind fields, providing a deeper exploration for the inversion of high-resolution historical meteorological grid data.

Read full abstract

PurposeAccurate deformable registration of magnetic resonance imaging (MRI) scans containing pathologies is challenging due to changes in tissue appearance. In this paper, we developed a novel automated three-dimensional (3D) convolutional U-Net based deformable image registration (ConvUNet-DIR) method using unsupervised learning to establish correspondence between baseline pre-operative and follow-up MRI scans of patients with brain glioma.MethodsThis study involved multi-parametric brain MRI scans (T1, T1-contrast enhanced, T2, FLAIR) acquired at pre-operative and follow-up time for 160 patients diagnosed with glioma, representing the BraTS-Reg 2022 challenge dataset. ConvUNet-DIR, a deep learning-based deformable registration workflow using 3D U-Net style architecture as a core, was developed to establish correspondence between the MRI scans. The workflow consists of three components: (1) the U-Net learns features from pairs of MRI scans and estimates a mapping between them, (2) the grid generator computes the sampling grid based on the derived transformation parameters, and (3) the spatial transformation layer generates a warped image by applying the sampling operation using interpolation. A similarity measure was used as a loss function for the network with a regularization parameter limiting the deformation. The model was trained via unsupervised learning using pairs of MRI scans on a training data set (n = 102) and validated on a validation data set (n = 26) to assess its generalizability. Its performance was evaluated on a test set (n = 32) by computing the Dice score and structural similarity index (SSIM) quantitative metrics. The model’s performance also was compared with the baseline state-of-the-art VoxelMorph (VM1 and VM2) learning-based algorithms.ResultsThe ConvUNet-DIR model showed promising competency in performing accurate 3D deformable registration. It achieved a mean Dice score of 0.975 ± 0.003 and SSIM of 0.908 ± 0.011 on the test set (n = 32). Experimental results also demonstrated that ConvUNet-DIR outperformed the VoxelMorph algorithms concerning Dice (VM1: 0.969 ± 0.006 and VM2: 0.957 ± 0.008) and SSIM (VM1: 0.893 ± 0.012 and VM2: 0.857 ± 0.017) metrics. The time required to perform a registration for a pair of MRI scans is about 1 s on the CPU.ConclusionsThe developed deep learning-based model can perform an end-to-end deformable registration of a pair of 3D MRI scans for glioma patients without human intervention. The model could provide accurate, efficient, and robust deformable registration without needing pre-alignment and labeling. It outperformed the state-of-the-art VoxelMorph learning-based deformable registration algorithms and other supervised/unsupervised deep learning-based methods reported in the literature.

Read full abstract

SSIM Metrics Research Articles

Related Topics

Articles published on SSIM Metrics

Joint Optimization-Based Texture and Geometry Enhancement Method for Single-Image-Based 3D Content Creation

A Deep Learning-Based Two-Branch Generative Adversarial Network for Image De-Raining

Robust image tamper detection and recovery with self-embedding watermarking using SPIHT and LDPC

Enhancing Steganography in 256×256 Colored Images with U-Net: A Study on PSNR and SSIM Metrics with Variable-Sized Hidden Images

Machine Learning for Pedestrian-Level Wind Comfort Analysis

Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism

Deformable registration of magnetic resonance images using unsupervised deep learning in neuro-/radiation oncology

Non-local sparse attention based swin transformer V2 for image super-resolution

Building Edge Detection Technology From Remote Sensing Image Based On NSCT And Tensor Voting

CGIHE-VDSR: Color global image histogram equalization with very deep super resolution networks for color image super resolution

MMDCP: An Image Enhancement Algorithm Incorporating Multi-Channel Phase Activation and Multi-Constrained Dark Channel Prior

Edge Boosted Global Awared Low-light Image Enhancement Network

DSG-GAN:A dual-stage-generator-based GAN for cross-modality synthesis from PET to CT

Colorful image reconstruction from neuromorphic event cameras with biologically inspired deep color fusion neural networks.

StainSWIN: Vision transformer-based stain normalization for histopathology image analysis

Neural radiance fields-based multi-view endoscopic scene reconstruction for surgical simulation.

HATF: Multi-Modal Feature Learning for Infrared and Visible Image Fusion via Hybrid Attention Transformer

FDSR: An Interpretable Frequency Division Stepwise Process Based Single-Image Super-Resolution Network.

Image Enhancement Network Architecture for Multidimensional Fusion of Medical Imaging Data under Intense Light Interference

A Cloud Coverage Image Reconstruction Approach for Remote Sensing of Temperature and Vegetation in Amazon Rainforest

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

SSIM Metrics Research Articles

Related Topics

Articles published on SSIM Metrics

Joint Optimization-Based Texture and Geometry Enhancement Method for Single-Image-Based 3D Content Creation

A Deep Learning-Based Two-Branch Generative Adversarial Network for Image De-Raining

Robust image tamper detection and recovery with self-embedding watermarking using SPIHT and LDPC

Enhancing Steganography in 256×256 Colored Images with U-Net: A Study on PSNR and SSIM Metrics with Variable-Sized Hidden Images

Machine Learning for Pedestrian-Level Wind Comfort Analysis

Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism

Deformable registration of magnetic resonance images using unsupervised deep learning in neuro-/radiation oncology

Non-local sparse attention based swin transformer V2 for image super-resolution

Building Edge Detection Technology From Remote Sensing Image Based On NSCT And Tensor Voting

CGIHE-VDSR: Color global image histogram equalization with very deep super resolution networks for color image super resolution

MMDCP: An Image Enhancement Algorithm Incorporating Multi-Channel Phase Activation and Multi-Constrained Dark Channel Prior

Edge Boosted Global Awared Low-light Image Enhancement Network

DSG-GAN:A dual-stage-generator-based GAN for cross-modality synthesis from PET to CT

Colorful image reconstruction from neuromorphic event cameras with biologically inspired deep color fusion neural networks.

StainSWIN: Vision transformer-based stain normalization for histopathology image analysis

Neural radiance fields-based multi-view endoscopic scene reconstruction for surgical simulation.

HATF: Multi-Modal Feature Learning for Infrared and Visible Image Fusion via Hybrid Attention Transformer

FDSR: An Interpretable Frequency Division Stepwise Process Based Single-Image Super-Resolution Network.

Image Enhancement Network Architecture for Multidimensional Fusion of Medical Imaging Data under Intense Light Interference

A Cloud Coverage Image Reconstruction Approach for Remote Sensing of Temperature and Vegetation in Amazon Rainforest