Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.

Nicholas Bien,Bhavik N Patel,Matthew P Lungren,Michael Bereket,Erik Jones,David B Larson,Andrew Y Ng,Curtis P Langlotz,Kristen W Yeom,Safwan Halabi,Katie Shpanskaya,Pranav Rajpurkar,Christopher F Beaulieu,Robyn L Ball,Francis G Blankenberg,Russell J Stewart,Derek F Amanatullah,Geoffrey M Riley,Allison Park,Jeremy Irvin,Ricky Jones ,Gary S Fanton ,Evan J Zucker

doi:10.1371/journal.pmed.1002699

Nicholas Bien, Bhavik N Patel + Show 21 more

Open Access

https://doi.org/10.1371/journal.pmed.1002699

Copy DOI

Journal: PLOS Medicine	Publication Date: Nov 27, 2018
Citations: 495	License type: CC BY 4.0

Affiliation: Stanford University

Abstract

BackgroundMagnetic resonance imaging (MRI) of the knee is the preferred method for diagnosing knee injuries. However, interpretation of knee MRI is time-intensive and subject to diagnostic error and variability. An automated system for interpreting knee MRI could prioritize high-risk patients and assist clinicians in making diagnoses. Deep learning methods, in being able to automatically learn layers of features, are well suited for modeling the complex relationships between medical images and their interpretations. In this study we developed a deep learning model for detecting general abnormalities and specific diagnoses (anterior cruciate ligament [ACL] tears and meniscal tears) on knee MRI exams. We then measured the effect of providing the model’s predictions to clinical experts during interpretation.Methods and findingsOur dataset consisted of 1,370 knee MRI exams performed at Stanford University Medical Center between January 1, 2001, and December 31, 2012 (mean age 38.0 years; 569 [41.5%] female patients). The majority vote of 3 musculoskeletal radiologists established reference standard labels on an internal validation set of 120 exams. We developed MRNet, a convolutional neural network for classifying MRI series and combined predictions from 3 series per exam using logistic regression. In detecting abnormalities, ACL tears, and meniscal tears, this model achieved area under the receiver operating characteristic curve (AUC) values of 0.937 (95% CI 0.895, 0.980), 0.965 (95% CI 0.938, 0.993), and 0.847 (95% CI 0.780, 0.914), respectively, on the internal validation set. We also obtained a public dataset of 917 exams with sagittal T1-weighted series and labels for ACL injury from Clinical Hospital Centre Rijeka, Croatia. On the external validation set of 183 exams, the MRNet trained on Stanford sagittal T2-weighted series achieved an AUC of 0.824 (95% CI 0.757, 0.892) in the detection of ACL injuries with no additional training, while an MRNet trained on the rest of the external data achieved an AUC of 0.911 (95% CI 0.864, 0.958). We additionally measured the specificity, sensitivity, and accuracy of 9 clinical experts (7 board-certified general radiologists and 2 orthopedic surgeons) on the internal validation set both with and without model assistance. Using a 2-sided Pearson’s chi-squared test with adjustment for multiple comparisons, we found no significant differences between the performance of the model and that of unassisted general radiologists in detecting abnormalities. General radiologists achieved significantly higher sensitivity in detecting ACL tears (p-value = 0.002; q-value = 0.019) and significantly higher specificity in detecting meniscal tears (p-value = 0.003; q-value = 0.019). Using a 1-tailed t test on the change in performance metrics, we found that providing model predictions significantly increased clinical experts’ specificity in identifying ACL tears (p-value < 0.001; q-value = 0.006). The primary limitations of our study include lack of surgical ground truth and the small size of the panel of clinical experts.ConclusionsOur deep learning model can rapidly generate accurate clinical pathology classifications of knee MRI exams from both internal and external datasets. Moreover, our results support the assertion that deep learning models can improve the performance of clinical experts during medical imaging interpretation. Further research is needed to validate the model prospectively and to determine its utility in the clinical setting.

Highlights

Magnetic resonance imaging (MRI) of the knee is the standard-of-care imaging modality to evaluate knee disorders, and more musculoskeletal (MSK) MRI examinations are performed on the knee than on any other region of the body [1,2,3]
The inter-rater agreement on the internal validation set among the 3 MSK radiologists, measured by the exact Fleiss kappa score, was 0.508 for detecting abnormalities, 0.800 for detecting anterior cruciate ligament (ACL) tears, and 0.745 for detecting meniscal tears
ACL tear detection, and meniscal tear detection, the model achieved area under the receiver operating characteristic curve (AUC) of 0.937, 0.965, and 0.847, respectively (Fig 5)

Summary

Introduction

Magnetic resonance imaging (MRI) of the knee is the standard-of-care imaging modality to evaluate knee disorders, and more musculoskeletal (MSK) MRI examinations are performed on the knee than on any other region of the body [1,2,3]. The negative predictive value of knee MRI is nearly 100%, so MRI serves as a noninvasive method to rule out surgical disorders such as anterior cruciate ligament (ACL) tears [11]. Due to the quantity and detail of images in each knee MRI exam, accurate interpretation of knee MRI is time-intensive and prone to interand intra-reviewer variability, even when performed by board-certified MSK radiologists [12]. An automated system for interpreting knee MRI images has a number of potential applications, such as quickly prioritizing high-risk patients in the radiologist workflow and assisting radiologists in making diagnoses [13]. In this study we developed a deep learning model for detecting general abnormalities and specific diagnoses (anterior cruciate ligament [ACL] tears and meniscal tears) on knee MRI exams. We measured the effect of providing the model’s predictions to clinical experts during interpretation

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Medicine

Lead the way for us

Similar Papers

Patterns of Meniscal Damage Associated with Acute ACL Rupture
Fawzi Aljassir
Journal of Orthopedics & Rheumatology | VOL. 01
Fawzi AljassirFawzi Aljassir
01 Jan 2014
Journal of Orthopedics & Rheumatology | VOL. 01

Magnetic resonance imaging vs. arthroscopy in diagnosing anterior cruciate ligament and meniscus injuries - is there a difference
Milan Mirkovic ... Sanja Mirkovic
Srpski arhiv za celokupno lekarstvo | VOL. 150
Milan Mirkovic, et. al.Milan Mirkovic ... Sanja Mirkovic
01 Jan 2021
Srpski arhiv za celokupno lekarstvo | VOL. 150

Identification of Radiographic Parameters Associated with Anterior Cruciate Ligament Injury
Austin Looney ... Edward Chang
Arthroscopy: The Journal of Arthroscopic & Related Surgery | VOL. 37
Austin Looney, et. al.Austin Looney ... Edward Chang
01 Jan 2020
Arthroscopy: The Journal of Arthroscopic & Related Surgery | VOL. 37

Poster 272: The Significance Of Posterior Tibial Slope And Rate Of Concomitant Pathology In Pediatric Tibia Spine Avulsion And Anterior Cruciate Ligament Injuries
Shital Parikh ... Michael Wilk
Orthopaedic Journal of Sports Medicine | VOL. 11
Shital Parikh, et. al.Shital Parikh ... Michael Wilk
01 Jul 2023
Orthopaedic Journal of Sports Medicine | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Medicine