U-Net Model Research Articles

The large language model ChatGPT can now accept image input with the GPT4-vision (GPT4V) version. We aimed to compare the performance of GPT4V to pretrained U-Net and vision transformer (ViT) models for the identification of the progression of multiple sclerosis (MS) on magnetic resonance imaging (MRI). Paired coregistered MR images with and without progression were provided as input to ChatGPT4V in a zero-shot experiment to identify radiologic progression. Its performance was compared to pretrained U-Net and ViT models. Accuracy was the primary evaluation metric and 95% confidence interval (CIs) were calculated by bootstrapping. We included 170 patients with MS (50 males, 120 females), aged 21-74 years (mean 42.3), imaged at a single institution from 2019 to 2021, each with 2-5 MRI studies (496 in total). One hundred seventy patients were included, 110 for training, 30 for tuning, and 30 for testing; 100 unseen paired images were randomly selected from the test set for evaluation. Both U-Net and ViT had 94% (95% CI: 89-98%) accuracy while GPT4V had 85% (77-91%). GPT4V gave cautious nonanswers in six cases. GPT4V had precision (specificity), recall (sensitivity), and F1 score of 89% (75-93%), 92% (82-98%), 91 (82-97%) compared to 100% (100-100%), 88 (78-96%), and 0.94 (88-98%) for U-Net and 94% (87-100%), 94 (88-100%), and 94 (89-98%) for ViT. The performance of GPT4V combined with its accessibility suggests has the potential to impact AI radiology research. However, misclassified cases and overly cautious non-answers confirm that it is not yet ready for clinical use. GPT4V can identify the radiologic progression of MS in a simplified experimental setting. However, GPT4V is not a medical device, and its widespread availability highlights the need for caution and education for lay users, especially those with limited access to expert healthcare. Without fine-tuning or the need for prior coding experience, GPT4V can perform a zero-shot radiologic change detection task with reasonable accuracy. However, in absolute terms, in a simplified "spot the difference" medical imaging task, GPT4V was inferior to state-of-the-art computer vision methods. GPT4V's performance metrics were more similar to the ViT than the U-net. This is an exploratory experimental study and GPT4V is not intended for use as a medical device.

Read full abstract

Accurate kidney and tumor segmentation of computed tomography (CT) scans is vital for diagnosis and treatment, but manual methods are time-consuming and inconsistent, highlighting the value of AI automation. This study develops a fully automated AI model using vision transformers (ViTs) and convolutional neural networks (CNNs) to detect and segment kidneys and kidney tumors in Contrast-Enhanced (CECT) scans, with a focus on improving sensitivity for small, indistinct tumors. The segmentation framework employs a ViT-based model for the kidney organ, followed by a 3D UNet model with enhanced connections and attention mechanisms for tumor detection and segmentation. Two CECT datasets were used: a public dataset (KiTS23: 489 scans) and a private institutional dataset (Private: 592 scans). The AI model was trained on 389 public scans, with validation performed on the remaining 100 scans and external validation performed on all 592 private scans. Tumors were categorized by TNM staging as small (≤4 cm) (KiTS23: 54%, Private: 41%), medium (>4 cm to ≤7 cm) (KiTS23: 24%, Private: 35%), and large (>7 cm) (KiTS23: 22%, Private: 24%) for detailed evaluation. Kidney and kidney tumor segmentations were evaluated against manual annotations as the reference standard. The model achieved a Dice score of 0.97 ± 0.02 for kidney organ segmentation. For tumor detection and segmentation on the KiTS23 dataset, the sensitivities and average false-positive rates per patient were as follows: 0.90 and 0.23 for small tumors, 1.0 and 0.08 for medium tumors, and 0.96 and 0.04 for large tumors. The corresponding Dice scores were 0.84 ± 0.11, 0.89 ± 0.07, and 0.91 ± 0.06, respectively. External validation on the private data confirmed the model's effectiveness, achieving the following sensitivities and average false-positive rates per patient: 0.89 and 0.15 for small tumors, 0.99 and 0.03 for medium tumors, and 1.0 and 0.01 for large tumors. The corresponding Dice scores were 0.84 ± 0.08, 0.89 ± 0.08, and 0.92 ± 0.06. The proposed model demonstrates consistent and robust performance in segmenting kidneys and kidney tumors of various sizes, with effective generalization to unseen data. This underscores the model's significant potential for clinical integration, offering enhanced diagnostic precision and reliability in radiological assessments.

Read full abstract

U-Net Model Research Articles

Related Topics

Articles published on U-Net Model

Image Segmentation Framework for Detecting Adversarial Attacks for Autonomous Driving Cars

Deep learning-based Monte Carlo dose prediction for heavy-ion online adaptive radiotherapy and fast quality assurance: A feasibility study.

GSCAT-UNET: Enhanced U-Net model with spatial-channel attention gate and three-level attention for oil spill detection using SAR data.

Wound Segmentation with U-Net Using a Dual Attention Mechanism and Transfer Learning.

CDCG-UNet: Chaotic Optimization Assisted Brain Tumor Segmentation Based on Dilated Channel Gate Attention U-Net Model.

Artificial Intelligence in Detecting and Segmenting Vertical Misfit of Prosthesis in Radiographic Images of Dental Implants: A Cross-Sectional Analysis.

Utilizing deep learning for automatic segmentation of the cochleae in temporal bone computed tomography.

Accurate Cardiac Duration Detection for Remote Blood Pressure Estimation Using mm-Wave Doppler Radar

Predicting branch retinal vein occlusion development using multimodal deep learning and pre-onset fundus hemisection images

Neural network solutions for artificial intelligence based on the new MIU-Net model for segmentation of the lung images in the diagnosis and treatment of lung diseases

Automatic detection and prediction of COVID-19 in cough audio signals using coronavirus herd immunity optimizer algorithm

Can ChatGPT4-vision identify radiologic progression of multiple sclerosis on brain MRI?

Grounding Grid Electrical Impedance Imaging Method Based on an Improved Conditional Generative Adversarial Network

A Modified Bi-Directional Convolutional U-Net (BCDU-Net) Neural Network Approach for Lung CT Image Segmentation

Enhancing Underwater Video from Consecutive Frames While Preserving Temporal Consistency

Retinal vasculature extraction algorithm based on an improved and lightweight U-Net deep learning model using the dense block

MIESTC: A Multivariable Spatio-Temporal Model for Accurate Short-Term Wind Speed Forecasting

Evaluating the effect of noise reduction strategies in CT perfusion imaging for predicting infarct core with deep learning.

Semi-Automatic Refinement of Myocardial Segmentations for Better LVNC Detection.

Dual-Stage AI Model for Enhanced CT Imaging: Precision Segmentation of Kidney and Tumors.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

U-Net Model Research Articles

Related Topics

Articles published on U-Net Model

Image Segmentation Framework for Detecting Adversarial Attacks for Autonomous Driving Cars

Deep learning-based Monte Carlo dose prediction for heavy-ion online adaptive radiotherapy and fast quality assurance: A feasibility study.

GSCAT-UNET: Enhanced U-Net model with spatial-channel attention gate and three-level attention for oil spill detection using SAR data.

Wound Segmentation with U-Net Using a Dual Attention Mechanism and Transfer Learning.

CDCG-UNet: Chaotic Optimization Assisted Brain Tumor Segmentation Based on Dilated Channel Gate Attention U-Net Model.

Artificial Intelligence in Detecting and Segmenting Vertical Misfit of Prosthesis in Radiographic Images of Dental Implants: A Cross-Sectional Analysis.

Utilizing deep learning for automatic segmentation of the cochleae in temporal bone computed tomography.

Accurate Cardiac Duration Detection for Remote Blood Pressure Estimation Using mm-Wave Doppler Radar

Predicting branch retinal vein occlusion development using multimodal deep learning and pre-onset fundus hemisection images

Neural network solutions for artificial intelligence based on the new MIU-Net model for segmentation of the lung images in the diagnosis and treatment of lung diseases

Automatic detection and prediction of COVID-19 in cough audio signals using coronavirus herd immunity optimizer algorithm

Can ChatGPT4-vision identify radiologic progression of multiple sclerosis on brain MRI?

Grounding Grid Electrical Impedance Imaging Method Based on an Improved Conditional Generative Adversarial Network

A Modified Bi-Directional Convolutional U-Net (BCDU-Net) Neural Network Approach for Lung CT Image Segmentation

Enhancing Underwater Video from Consecutive Frames While Preserving Temporal Consistency

Retinal vasculature extraction algorithm based on an improved and lightweight U-Net deep learning model using the dense block

MIESTC: A Multivariable Spatio-Temporal Model for Accurate Short-Term Wind Speed Forecasting

Evaluating the effect of noise reduction strategies in CT perfusion imaging for predicting infarct core with deep learning.

Semi-Automatic Refinement of Myocardial Segmentations for Better LVNC Detection.

Dual-Stage AI Model for Enhanced CT Imaging: Precision Segmentation of Kidney and Tumors.