Abstract

In the wake of the use of deep learning algorithms in medical image analysis, we compared performance of deep learning algorithms, namely the deep Boltzmann machine (DBM), convolutional encoder network (CEN) and patch-wise convolutional neural network (patch-CNN), with two conventional machine learning schemes: Support vector machine (SVM) and random forest (RF), for white matter hyperintensities (WMH) segmentation on brain MRI with mild or no vascular pathology. We also compared all these approaches with a method in the Lesion Segmentation Tool public toolbox named lesion growth algorithm (LGA). We used a dataset comprised of 60 MRI data from 20 subjects in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database, each scanned once every year during three consecutive years. Spatial agreement score, receiver operating characteristic and precision-recall performance curves, volume disagreement score, agreement with intra-/inter-observer reliability measurements and visual evaluation were used to find the best configuration of each learning algorithm for WMH segmentation. By using optimum threshold values for the probabilistic output from each algorithm to produce binary masks of WMH, we found that SVM and RF produced good results for medium to very large WMH burden but deep learning algorithms performed generally better than conventional ones in most evaluations.

Highlights

  • White matter hyperintensities (WMH) are brain regions that exhibit intensity levels higher than those of normal tissues on T2-weighted magnetic resonance images (MRI)

  • We investigate the accuracy of conventional machine learning algorithms and deep learning algorithms for white matter hyperintensities (WMH) segmentation through the comparison of each algorithm’s performance in multiple types of evaluation

  • The set of features is an important aspect that decides the performance of the learning algorithm

Read more

Summary

Introduction

White matter hyperintensities (WMH) are brain regions that exhibit intensity levels higher than those of normal tissues on T2-weighted magnetic resonance images (MRI). The machine learning algorithms evaluated in this study are SVM, RF, DBM, CEN and patch-CNN Their evaluation comprises spatial agreement score, receiver operating characteristic (ROC) and precision-recall (PR) performance curves and volumetric disagreement with intra-/inter-observer reliability measurements. 7/20 subjects (i.e., 7 subjects × 3 consecutive years × 2 measurements = 42 measurements in total), blind to the ground truth measurements and to previous assessments, for evaluating intra-observer reliability These were done semi-automatically using Mango [15], individually thresholding each WMH 3D cluster in the original FLAIR images. The data preprocessing steps comprise co-registration of the MRI sequences on each scanning session, skull stripping and intracranial volume mask generation, cortical grey matter, cerebrospinal fluid and brain ventricle extraction and intensity value normalisation.

Learning Algorithms and Their Configuration for Experiment
Lesion Growth Algorithm of Lesion Segmentation Tool
Support Vector Machine and Random Forest
Deep Boltzmann Machine
Convolutional Encoder Network
Patch-Wise Convolutional Neural Network
Experiments and Results
Conventional Machine Learning Experiment and Result
WMH Volume-Based Evaluation
Non-Parametric Correlation with Fazekas’s and Longstreth’s Visual Ratings
Visual Evaluation
Running Time Evaluation
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call