Computed Tomography Datasets Research Articles

Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current shortage of both general and specialized radiologists, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies while simultaneously using the images to extract novel physiological insights. Prior state-of-the-art approaches for automated medical image interpretation leverage vision language models (VLMs) that utilize both the image and the corresponding textual radiology reports. However, current medical VLMs are generally limited to 2D images and short reports. To overcome these shortcomings for abdominal CT interpretation, we introduce Merlin - a 3D VLM that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining without requiring additional manual annotations. We train Merlin using a high-quality clinical dataset of paired CT scans (6+ million images from 15,331 CTs), EHR diagnosis codes (1.8+ million codes), and radiology reports (6+ million tokens) for training. We comprehensively evaluate Merlin on 6 task types and 752 individual tasks. The non-adapted (off-the-shelf) tasks include zero-shot findings classification (31 findings), phenotype classification (692 phenotypes), and zero-shot cross-modal retrieval (image to findings and image to impressions), while model adapted tasks include 5-year chronic disease prediction (6 diseases), radiology report generation, and 3D semantic segmentation (20 organs). We perform internal validation on a test set of 5,137 CTs, and external validation on 7,000 clinical CTs and on two public CT datasets (VerSe, TotalSegmentator). Beyond these clinically-relevant evaluations, we assess the efficacy of various network architectures and training strategies to depict that Merlin has favorable performance to existing task-specific baselines. We derive data scaling laws to empirically assess training data needs for requisite downstream task performance. Furthermore, unlike conventional VLMs that require hundreds of GPUs for training, we perform all training on a single GPU. This computationally efficient design can help democratize foundation model training, especially for health systems with compute constraints. We plan to release our trained models, code, and dataset, pending manual removal of all protected health information.

Read full abstract

Brain computed tomography (CT) is an accessible and commonly utilized technique for assessing brain structure. In cases of idiopathic normal pressure hydrocephalus (iNPH), the presence of ventriculomegaly is often neuroradiologically evaluated by visual rating and manually measuring each image. Previously, we have developed and tested a deep-learning-model that utilizes transfer learning from magnetic resonance imaging (MRI) for CT-based intracranial tissue segmentation. Accordingly, herein we aimed to enhance the segmentation of ventricular cerebrospinal fluid (VCSF) in brain CT scans and assess the performance of automated brain CT volumetrics in iNPH patient diagnostics. The development of the model used a two-stage approach. Initially, a 2D U-Net model was trained to predict VCSF segmentations from CT scans, using paired MR-VCSF labels from healthy controls. This model was subsequently refined by incorporating manually segmented lateral CT-VCSF labels from iNPH patients, building on the features learned from the initial U-Net model. The training dataset included 734 CT datasets from healthy controls paired with T1-weighted MRI scans from the Gothenburg H70 Birth Cohort Studies and 62 CT scans from iNPH patients at Uppsala University Hospital. To validate the model's performance across diverse patient populations, external clinical images including scans of 11 iNPH patients from the Universitatsmedizin Rostock, Germany, and 30 iNPH patients from the University of Alabama at Birmingham, United States were used. Further, we obtained three CT-based volumetric measures (CTVMs) related to iNPH. Our analyses demonstrated strong volumetric correlations (ϱ=0.91, p<0.001) between automatically and manually derived CT-VCSF measurements in iNPH patients. The CTVMs exhibited high accuracy in differentiating iNPH patients from controls in external clinical datasets with an AUC of 0.97 and in the Uppsala University Hospital datasets with an AUC of 0.99. CTVMs derived through deep learning, show potential for assessing and quantifying morphological features in hydrocephalus. Critically, these measures performed comparably to gold-standard neuroradiology assessments in distinguishing iNPH from healthy controls, even in the presence of intraventricular shunt catheters. Accordingly, such an approach may serve to improve the radiological evaluation of iNPH diagnosis/monitoring (i.e., treatment responses). Since CT is much more widely available than MRI, our results have considerable clinical impact.

Read full abstract

Computed Tomography Datasets Research Articles

Related Topics

Articles published on Computed Tomography Datasets

Merlin: A Vision Language Foundation Model for 3D Computed Tomography.

Assessing CT-based Volumetric Analysis via Transfer Learning with MRI and Manual Labels for Idiopathic Normal Pressure Hydrocephalus.

An emerging network for COVID-19 CT-scan classification using an ensemble deep transfer learning model

MedYOLO: A Medical Image Object Detection Framework.

Orthognathic surgery improves compromised natural head position and pharyngeal airway in patients with Skeletal Class II or III malocclusion.

A landmark-supervised registration framework for multi-phase CT images with cross-distillation

Three-Dimensional Segmentation of Equine Paranasal Sinuses in Multidetector Computed Tomography Datasets: Preliminary Morphometric Assessment Assisted with Clustering Analysis.

Evaluation of an Artificial Intelligence Model for Identification of Intracranial Hemorrhage Subtypes on Computed Tomography of the Head

WaveletDFDS-Net: A Dual Forward Denoising Stream Network for Low-Dose CT Noise Reduction

Noninvasive Atherosclerotic Phenotyping: The Next Frontier into Understanding the Pathobiology of Coronary Artery Disease.

Influence of learned landmark correspondences on lung CT registration.

Refining computer tomography data with super-resolution networks to increase the accuracy of respiratory flow simulations

Treatment duration by morphology and location of impacted maxillary canines: A cone-beam computed tomography investigation

MIB-Net: Balance the mutual information flow in deep learning network for multi-dimensional segmentation of COVID-19 CT images

Prediction of recurrence-free survival in lung adenocarcinoma based on self-supervised pre-training and multi-task learning

High throughput automated characterization of enamel microstructure using synchrotron tomography and optical flow imaging

Wasserstein-GAN-based noise reduction for preserving anatomical structures in low-dose CT

Exploration of postural effects on the external jugular and diploic venous system using upright computed tomography scanning.

Recurrent feature propagation and edge skip-connections for automatic abdominal organ segmentation

Noise aware content-noise complementary GAN with local and global discrimination for low-dose CT denoising

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Computed Tomography Datasets Research Articles

Related Topics

Articles published on Computed Tomography Datasets

Merlin: A Vision Language Foundation Model for 3D Computed Tomography.

Assessing CT-based Volumetric Analysis via Transfer Learning with MRI and Manual Labels for Idiopathic Normal Pressure Hydrocephalus.

An emerging network for COVID-19 CT-scan classification using an ensemble deep transfer learning model

MedYOLO: A Medical Image Object Detection Framework.

Orthognathic surgery improves compromised natural head position and pharyngeal airway in patients with Skeletal Class II or III malocclusion.

A landmark-supervised registration framework for multi-phase CT images with cross-distillation

Three-Dimensional Segmentation of Equine Paranasal Sinuses in Multidetector Computed Tomography Datasets: Preliminary Morphometric Assessment Assisted with Clustering Analysis.

Evaluation of an Artificial Intelligence Model for Identification of Intracranial Hemorrhage Subtypes on Computed Tomography of the Head

WaveletDFDS-Net: A Dual Forward Denoising Stream Network for Low-Dose CT Noise Reduction

Noninvasive Atherosclerotic Phenotyping: The Next Frontier into Understanding the Pathobiology of Coronary Artery Disease.

Influence of learned landmark correspondences on lung CT registration.

Refining computer tomography data with super-resolution networks to increase the accuracy of respiratory flow simulations

Treatment duration by morphology and location of impacted maxillary canines: A cone-beam computed tomography investigation

MIB-Net: Balance the mutual information flow in deep learning network for multi-dimensional segmentation of COVID-19 CT images

Prediction of recurrence-free survival in lung adenocarcinoma based on self-supervised pre-training and multi-task learning

High throughput automated characterization of enamel microstructure using synchrotron tomography and optical flow imaging

Wasserstein-GAN-based noise reduction for preserving anatomical structures in low-dose CT

Exploration of postural effects on the external jugular and diploic venous system using upright computed tomography scanning.

Recurrent feature propagation and edge skip-connections for automatic abdominal organ segmentation

Noise aware content-noise complementary GAN with local and global discrimination for low-dose CT denoising