To date, only a few small studies have attempted deep learning-based automatic segmentation of white matter hyperintensity (WMH) lesions in patients with cerebral infarction, which is complicated because stroke-related lesions can obscure WMH borders. We developed and validated deep learning algorithms to segment WMH lesions accurately in patients with cerebral infarction, using multisite datasets involving 8,421 patients with acute ischemic stroke. We included 8,421 stroke patients from 9 centers in Korea. 2D UNet and SE-Unet models were trained using 2,408 FLAIR MRI from 3 hospitals and validated using 6,013 FLAIR MRIs from 6 hospitals. WMH segmentation performance was assessed by calculating DSC, correlation coefficient, and concordance correlation coefficient compared to a human-segmented gold standard. In addition, we obtained an uncertainty index that represents overall ambiguity in the voxel classification for WMH segmentation in each patient based on the Kullback-Leibler divergence. In the training dataset, the mean age was 67.4±13.0 years and 60.4% were men. The mean (95% CI) DSCs for Unet in internal testing and external validation were respectively 0.659 (0.649-0.669) and 0.710 (0.707-0.714), which were slightly lower than the reliability between humans (DSC=0.744; 95% CI=0.738-0.751; P=.031). Compared with the Unet, the SE-Unet demonstrated better performance, achieving a mean DSC of 0.675 (0.666-0.685; P<.001) in the internal testing and 0.722 (0.719-0.726; P<.001) in the external validation; moreover, it achieved high DSC values (ranging from 0.672 to 0.744) across multiple validation datasets. We observed a significant correlation between WMH volumes that were segmented automatically and manually for the Unet (r=0.917, P<.0001) and even stronger for the SE-Unet (r=0.933, P<.0001). The SE-Unet also attained a high concordance correlation coefficient (ranging from 0.841 to 0.956) in external test datasets. In addition, the uncertainty indices in the majority of patients (86%) in the external datasets were below 0.35, with an average DSC of 0.744 in these patients. We developed and validated deep learning algorithms to segment WMH in patients with acute cerebral infarction using the largest-ever MRI datasets. In addition, we showed that the uncertainty index can be used to identify cases where automatic WMH segmentation is less accurate and requires human review. WMH = white matter hyperintensity; CNN = convolutional neural networks; SE = squeeze-and-excitation; KL = Kullback-Leibler; ReLU = rectified linear unit; LKW = last known well; mRS = modified Rankin Scale; NIHSS = National Institute of Health Stroke Scale; LAA = large artery atherosclerosis; SVO = small vessel occlusion; CE = cardioembolism.
Read full abstract