Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning

Sidong Liu,Aydin Sav,Yi Qian,Zubair Shah,Shlomo Berkovsky,Enrico Coiera,Carlo Russo,Antonio Di Ieva

doi:10.1038/s41598-020-64588-y

Sidong Liu, Aydin Sav + Show 6 more

Open Access

https://doi.org/10.1038/s41598-020-64588-y

Copy DOI

Abstract

Mutations in isocitrate dehydrogenase genes IDH1 and IDH2 are frequently found in diffuse and anaplastic astrocytic and oligodendroglial tumours as well as in secondary glioblastomas. As IDH is a very important prognostic, diagnostic and therapeutic biomarker for glioma, it is of paramount importance to determine its mutational status. The haematoxylin and eosin (H&E) staining is a valuable tool in precision oncology as it guides histopathology-based diagnosis and proceeding patient’s treatment. However, H&E staining alone does not determine the IDH mutational status of a tumour. Deep learning methods applied to MRI data have been demonstrated to be a useful tool in IDH status prediction, however the effectiveness of deep learning on H&E slides in the clinical setting has not been investigated so far. Furthermore, the performance of deep learning methods in medical imaging has been practically limited by small sample sizes currently available. Here we propose a data augmentation method based on the Generative Adversarial Networks (GAN) deep learning methodology, to improve the prediction performance of IDH mutational status using H&E slides. The H&E slides were acquired from 266 grade II-IV glioma patients from a mixture of public and private databases, including 130 IDH-wildtype and 136 IDH-mutant patients. A baseline deep learning model without data augmentation achieved an accuracy of 0.794 (AUC = 0.920). With GAN-based data augmentation, the accuracy of the IDH mutational status prediction was improved to 0.853 (AUC = 0.927) when the 3,000 GAN generated training samples were added to the original training set (24,000 samples). By integrating also patients’ age into the model, the accuracy improved further to 0.882 (AUC = 0.931). Our findings show that deep learning methodology, enhanced by GAN data augmentation, can support physicians in gliomas’ IDH status prediction.

Highlights

The haematoxylin and eosin (H&E) stain in histopathology is a valuable tool for precision oncology and is used in assisting the diagnosis of glioma and other tumours
200 patients were randomly selected from the glioma cohorts of The Cancer Genome Atlas (TCGA)[32] and another cohort of 66 patients were recruited at a local hospital
The baseline deep learning model based on ResNet[50] without data augmentation (24,000 image samples) achieved the accuracy of 0.765 (AUC = 0.823) on the validation set, and the accuracy of 0.794 (AUC = 0.920) on the test set. These results demonstrate that convolution neural network (CNN) can effectively identify the IDH mutational status

Summary

Introduction

The haematoxylin and eosin (H&E) stain in histopathology is a valuable tool for precision oncology and is used in assisting the diagnosis of glioma and other tumours. The effectiveness of deep learning in classification and mutation prediction of H&E slides has recently been explored for non-small cell lung cancer[25] and in virtual histological staining of unlabelled tissue images[26]. We propose a deep learning-based model for histopathological image classification This model is enhanced by a data augmentation method based on Generative Adversarial Network (GAN)[30]. It has been demonstrated that GAN augmentation can effectively improve the performance of the deep learning models in brain lesion segmentation[31]. We first demonstrate that deep learning is a useful and accurate tool in differentiating IDH mutation from IDH-wildtype gliomas based on histopathology images. We demonstrate that GAN-based data augmentation may further assist in histopathology image classification

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: May 7, 2020
Citations: 79	License type: open-access

R Discovery Prime

R Discovery Prime

Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

GAN-Based Data Augmentation and Anonymization for Mask Classification
Mustafa Çelik ... Ahmet Haydarörnek
-
Mustafa Çelik, et. al.Mustafa Çelik ... Ahmet Haydarörnek
23 Dec 2021
23 Dec 2021

Enhancing OCT patch-based segmentation with improved GAN data augmentation and semi-supervised learning
Jason Kugelman ... Michael J Collins
Neural Computing and Applications | VOL. 36
Jason Kugelman, et. al.Jason Kugelman ... Michael J Collins
13 Jul 2024
Neural Computing and Applications | VOL. 36

Data augmentation for skin lesion using self-attention based progressive generative adversarial network
Ibrahim Saad Aly Abdelhalim ... Yousef Bassyouni Mahdy
Expert Systems with Applications | VOL. 165
Ibrahim Saad Aly Abdelhalim, et. al.Ibrahim Saad Aly Abdelhalim ... Yousef Bassyouni Mahdy
03 Sep 2020
Expert Systems with Applications | VOL. 165

A Survey on GAN Techniques for Data Augmentation to Address the Imbalanced Data Issues in Credit Card Fraud Detection
Emilija Strelcenia ... Simant Prakoonwit
Machine Learning and Knowledge Extraction | VOL. 5
Emilija Strelcenia, et. al.Emilija Strelcenia ... Simant Prakoonwit
11 Mar 2023
Machine Learning and Knowledge Extraction | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports