Abstract

One of the core challenges in applying machine learning and artificial intelligence to medicine is the limited availability of annotated medical data. Unlike in other applications of machine learning, where an abundance of labeled data is available, the labeling and annotation of medical data and images require a major effort of manual work by expert clinicians who do not have the time to annotate manually. In this work, we propose a new deep learning technique (SLIVER-net), to predict clinical features from 3-dimensional volumes using a limited number of manually annotated examples. SLIVER-net is based on transfer learning, where we borrow information about the structure and parameters of the network from publicly available large datasets. Since public volume data are scarce, we use 2D images and account for the 3-dimensional structure using a novel deep learning method which tiles the volume scans, and then adds layers that leverage the 3D structure. In order to illustrate its utility, we apply SLIVER-net to predict risk factors for progression of age-related macular degeneration (AMD), a leading cause of blindness, from optical coherence tomography (OCT) volumes acquired from multiple sites. SLIVER-net successfully predicts these factors despite being trained with a relatively small number of annotated volumes (hundreds) and only dozens of positive training examples. Our empirical evaluation demonstrates that SLIVER-net significantly outperforms standard state-of-the-art deep learning techniques used for medical volumes, and its performance is generalizable as it was validated on an external testing set. In a direct comparison with a clinician panel, we find that SLIVER-net also outperforms junior specialists, and identifies AMD progression risk factors similarly to expert retina specialists.

Highlights

  • The application of deep learning, Convolutional Neural Networks (CNNs), has proven to be successful for detecting and predicting disease from medical image data[1,2,3,4,5]

  • We found that SLIVER-net outperforms a standard 3D CNN in the setting of a relatively small sample size

  • We developed a new deep learning technique, SLIVER-net, to predict clinical features from optical coherence tomography (OCT) volumes

Read more

Summary

INTRODUCTION

The application of deep learning, Convolutional Neural Networks (CNNs), has proven to be successful for detecting and predicting disease from medical image data[1,2,3,4,5]. A key component of SLIVER-net was flattening the OCT volume into an image by stacking the different slices into one long image (see “Methods”) This allowed us to incorporate a large publicly available dataset[4] using transfer learning, which is commonly used to address prediction problems when the amount of training data is small[17]. SLIVER-net was pre-trained on the OCT dataset collected by Kermany et al.[38] This data consisted of 84,495 2D horizontal OCT B-scan images (e.g., slices) passing through the fovea but were labeled with other ocular diseases (Choroidal neovascularization (CNV), diabetic macular edema (DME), and Drusen). Resolution did not significantly affect the model’s performance for biomarker prediction (Fig. 9) In both scenarios, we have observed that SLIVER-net was robust to different sizes and resolutions of OCT scans, making it useful in various clinical scenarios and under different resource constraints

DISCUSSION
METHODS
Findings
CODE AVAILABILITY
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call