Radiology Reports Research Articles

Large language models like GPT-4 have demonstrated potential for diagnosis in radiology. Previous studies investigating this potential primarily utilized quizzes from academic journals. This study aimed to assess the diagnostic capabilities of GPT-4-based Chat Generative Pre-trained Transformer (ChatGPT) using actual clinical radiology reports of brain tumors and compare its performance with that of neuroradiologists and general radiologists. We collected brain MRI reports written in Japanese from preoperative brain tumor patients at two institutions from January 2017 to December 2021. The MRI reports were translated into English by radiologists. GPT-4 and five radiologists were presented with the same textual findings from the reports and asked to suggest differential and final diagnoses. The pathological diagnosis of the excised tumor served as the ground truth. McNemar's test and Fisher's exact test were used for statistical analysis. In a study analyzing 150 radiological reports, GPT-4 achieved a final diagnostic accuracy of 73%, while radiologists' accuracy ranged from 65 to 79%. GPT-4's final diagnostic accuracy using reports from neuroradiologists was higher at 80%, compared to 60% using those from general radiologists. In the realm of differential diagnoses, GPT-4's accuracy was 94%, while radiologists' fell between 73 and 89%. Notably, for these differential diagnoses, GPT-4's accuracy remained consistent whether reports were from neuroradiologists or general radiologists. GPT-4 exhibited good diagnostic capability, comparable to neuroradiologists in differentiating brain tumors from MRI reports. GPT-4 can be a second opinion for neuroradiologists on final diagnoses and a guidance tool for general radiologists and residents. This study evaluated GPT-4-based ChatGPT's diagnostic capabilities using real-world clinical MRI reports from brain tumor cases, revealing that its accuracy in interpreting brain tumors from MRI findings is competitive with radiologists. We investigated the diagnostic accuracy of GPT-4 using real-world clinical MRI reports of brain tumors. GPT-4 achieved final and differential diagnostic accuracy that is comparable with neuroradiologists. GPT-4 has the potential to improve the diagnostic process in clinical radiology.

Read full abstract

Purpose of the study: To evaluate the performance of Best Practice Recommendation (BPR) compliance in reporting abdominal aortic aneurysm findings on imaging, comparing the results before and after its deployment. Methods: Best Practice Recommendations for AAA were deployed in 2020 at a large radiology practice site. Reports between January 2018 through October 2022 were reviewed, representing studies read prior to and subsequent to the implementation of the reporting standards. Cases of abdominal aortic aneurysms ≥ 2.6 cm were counted by year. Adherence to the BPR for each year was calculated as [total number of confirmed cases of ≥ 2.6 cm AAAs with compliant reports] * 100 / [the total number of confirmed ≥ 2.6 cm AAAs]. A secondary analysis was performed to determine whether there was a statistically significant difference in the proportion of BPR-compliant reports for AAA cases before (from 2018 to 2019) and after (from 2020 to 2022) BPR deployment using a chi-square test. Results: From January 2018 to December 2022, there were 8,693 reports referencing AAA. After excluding cases of suspected AAA (N = 2,131), confirmed AAAs with indeterminate sizes (N = 103), and confirmed AAAs with sizes < 2.6 cm (N = 85), the number of AAA cases ≥ 2.6 cm in size was 6,374. Concordance with the BPR standards for the remaining cases with sizes ≥ 2.6 cm were 1.6% and 4.1% in 2018 and 2019, respectively. Post-implementation of BPRs, there was a substantial improvement in guideline adherence to 32.1%, 84.3%, and 83.6% in 2020, 2021, and 2022, respectively. In general, the proportion of BPR-compliant reports of AAA cases in the pre-deployment (3.6%) period statistically differs (p - value < 0.0001) from those in the post-deployment period (73.9%) Conclusion: Adherence to reporting standards increased after the BPR deployment in 2020. The inclusion of management recommendations in the radiology report when AAA is identified is a simple and cost-effective way of improving outcomes for patients with AAAs through appropriate follow-up treatment.

Read full abstract

Radiology Reports Research Articles

Related Topics

Articles published on Radiology Reports

Interactive dual-stream contrastive learning for radiology report generation

Comparative analysis of GPT-4-based ChatGPT's diagnostic performance with radiologists using real-world radiology reports of brain tumors.

Use of reporting templates for chest radiographs in a coronavirus disease 2019 context: measuring concordance of radiologists with three international templates.

Precise Image-level Localization of Intracranial Hemorrhage on Head CT Scans with Deep Learning Models Trained on Study-level Labels.

Limited Utility of Dynamic Contrast Enhancement Imaging Sequences Within the PI-RADS v2.1 Classification Scheme: A Retrospective Cross-Sectional Study of MRI Reports.

Spleen anomalies and lesions in CT and MRI: essentials for radiologists and clinicians-a pictorial review.

The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports.

Preliminary image evaluation performance of radiographers in one New Zealand District: a 6-month prospective study.

Establishment of a Best Practices in Reporting (BPR) Recommendation for Abdominal Aortic Aneurysms in a Large Multi-State Radiology Practice: Adoption and Impact

A vision-language foundation model for the generation of realistic chest X-ray images.

A scoping review of large language model based approaches for information extraction from radiology reports

Generating colloquial radiology reports with large language models.

Assessment of Follow-Up for Pulmonary Nodules from Radiology Reports with Natural Language Processing.

ChatGPT in Radiology: Insights into Current Advantages and Limitations of Artificial Intelligence in Radiology Reporting

A Novel Deep Learning Approach for the Automatic Diagnosis of Acute Appendicitis.

Staging peritoneal metastases in colorectal cancer: The correlation between MRI, surgical and histopathological peritoneal cancer index

Towards a Reporting Guideline for Studies on Information Extraction from Clinical Texts.

Collaborating with non-radiological clinical colleagues.

Preliminary assessment of TNM classification performance for pancreatic cancer in Japanese radiology reports using GPT-4.

DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image–Text Alignment for Automatic Radiology Report Generation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Radiology Reports Research Articles

Related Topics

Articles published on Radiology Reports

Interactive dual-stream contrastive learning for radiology report generation

Comparative analysis of GPT-4-based ChatGPT's diagnostic performance with radiologists using real-world radiology reports of brain tumors.

Use of reporting templates for chest radiographs in a coronavirus disease 2019 context: measuring concordance of radiologists with three international templates.

Precise Image-level Localization of Intracranial Hemorrhage on Head CT Scans with Deep Learning Models Trained on Study-level Labels.

Limited Utility of Dynamic Contrast Enhancement Imaging Sequences Within the PI-RADS v2.1 Classification Scheme: A Retrospective Cross-Sectional Study of MRI Reports.

Spleen anomalies and lesions in CT and MRI: essentials for radiologists and clinicians-a pictorial review.

The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports.

Preliminary image evaluation performance of radiographers in one New Zealand District: a 6-month prospective study.

Establishment of a Best Practices in Reporting (BPR) Recommendation for Abdominal Aortic Aneurysms in a Large Multi-State Radiology Practice: Adoption and Impact

A vision-language foundation model for the generation of realistic chest X-ray images.

A scoping review of large language model based approaches for information extraction from radiology reports

Generating colloquial radiology reports with large language models.

Assessment of Follow-Up for Pulmonary Nodules from Radiology Reports with Natural Language Processing.

ChatGPT in Radiology: Insights into Current Advantages and Limitations of Artificial Intelligence in Radiology Reporting

A Novel Deep Learning Approach for the Automatic Diagnosis of Acute Appendicitis.

Staging peritoneal metastases in colorectal cancer: The correlation between MRI, surgical and histopathological peritoneal cancer index

Towards a Reporting Guideline for Studies on Information Extraction from Clinical Texts.

Collaborating with non-radiological clinical colleagues.

Preliminary assessment of TNM classification performance for pancreatic cancer in Japanese radiology reports using GPT-4.

DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image–Text Alignment for Automatic Radiology Report Generation