Radiological Interpretation Research Articles

Background: Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. Methods: One year of consecutive thyroid US with ≥1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. Results: Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test p < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, p = 1). Conclusion: Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.

Read full abstract

Background & AimsHepatocellular carcinoma (HCC) is characterized by a high mortality rate. The Liver Imaging Reporting and Data System (LI-RADS) results in considerable proportions of indeterminate observations, rendering an accurate diagnosis difficult. MethodsWe developed four deep learning models for diagnosing HCC on computed tomography (CT) via a training-validation-testing approach. Thin-slice triphasic CT liver images and relevant clinical information were collected and processed for deep learning. HCC was diagnosed and verified via a 12-month clinical composite reference standard. CT observations among at-risk patients were annotated using LI-RADS. Diagnostic performance was assessed by internal validation and independent external testing. We conducted sensitivity analyses of different subgroups, deep learning explainability evaluation, and misclassification analysis. ResultsFrom 2,832 patients and 4,305 CT observations, the best-performing model was Spatio-Temporal 3D Convolution Network (ST3DCN), achieving area under curves (AUCs) of 0.919 (95%CI 0.903-0.935) and 0.901 (95%CI 0.879-0.924) at the observation (n=1077) and patient (n=685) levels respectively during internal validation, compared to 0.839 (95%CI 0.814-0.864) and 0.822 (95%CI 0.790-0.853) respectively for standard-of-care radiological interpretation. ST3DCN’s negative predictive values were 0.966 (95%CI 0.954-0.979) and 0.951 (95%CI 0.931-0.971) respectively. ST3DCN’s observation-level AUCs among at-risk patients, 2-5 cm observations and singular porto-venous phase analysis were 0.899 (95%CI 0.874-0.924), 0.872 (95%CI 0.838-0.909) and 0.912 (95%CI 0.895-0.929) respectively. In external testing (551/717 patients/observations), ST3DCN’s AUC was 0.901 (95%CI 0.877-0.924), non-inferior to radiological interpretation (AUC 0.900, 95%CI 0.877-923). ConclusionsST3DCN achieved strong, robust performance for accurate HCC diagnosis on CT. Deep learning can expedite and improve the diagnostic process of HCC. Impact and implicationsThe clinical applicability of deep learning in HCC diagnosis is potentially huge, especially considering the expected increase in the incidence and mortality of HCC in Eastern Asia and worldwide. Early diagnosis through deep learning can lead to earlier definitive management, particularly for at-risk patients. The model can be broadly deployed for patients undergoing a triphasic contrast CT scan of the liver to reduce the current high mortality rate of HCC.

Read full abstract

Radiological Interpretation Research Articles

Related Topics

Articles published on Radiological Interpretation

Celiac Trunk with a Replaced Right Hepatic Artery: A Rare Anatomical Variant

Machine learning applications in breast cancer prediction using mammography

Comparative Analysis of Large Language Models and Spine Surgeons in Surgical Decision-Making and Radiological Assessment for Spine Pathologies

Skeletal radiograph interpretation discrepancies in the emergency department setting: A retrospective chart review.

Development of a deep learning method to identify acute ischaemic stroke lesions on brain CT

Quantification of Interstitial Lung Diseases, From the AJR Special Series on Quantitative Imaging.

Early Ischemic Stroke Assessment with ASPECTS: A Case Report Highlighting the Radiologist's Role in a Limited-Resource Setting

Analysis of panel physician inquiries to U.S. TB Centers of Excellence, 2018-2022.

Gamma Dose Rate Measurements in Northern Spain: Influence of Local Meteorological Scenarios on Radiological "False Alarms" in a Real-Time Radiological Monitoring Network.

The Role of Artificial Intelligence in Diagnostic Radiology.

Simulating clinical features on chest radiographs for medical image exploration and CNN explainability using a style-based generative adversarial autoencoder

Revolution or risk?-Assessing the potential and challenges of GPT-4V in radiologic image interpretation.

Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound.

Characteristics of the Reciprocal Movement of Radiographers’ Gaze Based on a Comparison of Entry-Level and Experienced Radiographers

Improvement of quality of life in women ≤ 25-years-old with chronic pelvic pain following stenting of nonthrombotic iliac vein compression.

The “Hungry Judge” effect on prostate MRI reporting: Chronobiological trends from 35’004 radiologist interpretations

Variability in chest radiology interpretation between thoracic and non-thoracic radiologists: Implications for pulmonary fibrosis care

Concordance and Discordance Between Radiology Residents and Consultant Radiologist Interpretation Of CT Brain

Role of teleradiology in the interpretation of ultrasound images acquired in the emergency setting

Application of a deep learning algorithm for the diagnosis of HCC

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Radiological Interpretation Research Articles

Related Topics

Articles published on Radiological Interpretation

Celiac Trunk with a Replaced Right Hepatic Artery: A Rare Anatomical Variant

Machine learning applications in breast cancer prediction using mammography

Comparative Analysis of Large Language Models and Spine Surgeons in Surgical Decision-Making and Radiological Assessment for Spine Pathologies

Skeletal radiograph interpretation discrepancies in the emergency department setting: A retrospective chart review.

Development of a deep learning method to identify acute ischaemic stroke lesions on brain CT

Quantification of Interstitial Lung Diseases, From the AJR Special Series on Quantitative Imaging.

Early Ischemic Stroke Assessment with ASPECTS: A Case Report Highlighting the Radiologist's Role in a Limited-Resource Setting

Analysis of panel physician inquiries to U.S. TB Centers of Excellence, 2018-2022.

Gamma Dose Rate Measurements in Northern Spain: Influence of Local Meteorological Scenarios on Radiological "False Alarms" in a Real-Time Radiological Monitoring Network.

The Role of Artificial Intelligence in Diagnostic Radiology.

Simulating clinical features on chest radiographs for medical image exploration and CNN explainability using a style-based generative adversarial autoencoder

Revolution or risk?-Assessing the potential and challenges of GPT-4V in radiologic image interpretation.

Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound.

Characteristics of the Reciprocal Movement of Radiographers’ Gaze Based on a Comparison of Entry-Level and Experienced Radiographers

Improvement of quality of life in women ≤ 25-years-old with chronic pelvic pain following stenting of nonthrombotic iliac vein compression.

The “Hungry Judge” effect on prostate MRI reporting: Chronobiological trends from 35’004 radiologist interpretations

Variability in chest radiology interpretation between thoracic and non-thoracic radiologists: Implications for pulmonary fibrosis care

Concordance and Discordance Between Radiology Residents and Consultant Radiologist Interpretation Of CT Brain

Role of teleradiology in the interpretation of ultrasound images acquired in the emergency setting

Application of a deep learning algorithm for the diagnosis of HCC