Biological interpretation of deep neural network for phenotype prediction based on gene expression

Blaise Hanczar,Mathieu Arles,Tina Issa,Farida Zehraoui

doi:10.1186/s12859-020-03836-4

Abstract

BackgroundThe use of predictive gene signatures to assist clinical decision is becoming more and more important. Deep learning has a huge potential in the prediction of phenotype from gene expression profiles. However, neural networks are viewed as black boxes, where accurate predictions are provided without any explanation. The requirements for these models to become interpretable are increasing, especially in the medical field.ResultsWe focus on explaining the predictions of a deep neural network model built from gene expression data. The most important neurons and genes influencing the predictions are identified and linked to biological knowledge. Our experiments on cancer prediction show that: (1) deep learning approach outperforms classical machine learning methods on large training sets; (2) our approach produces interpretations more coherent with biology than the state-of-the-art based approaches; (3) we can provide a comprehensive explanation of the predictions for biologists and physicians.ConclusionWe propose an original approach for biological interpretation of deep learning models for phenotype prediction from gene expression data. Since the model can find relationships between the phenotype and gene expression, we may assume that there is a link between the identified genes and the phenotype. The interpretation can, therefore, lead to new biological hypotheses to be investigated by biologists.

Highlights

The use of predictive gene signatures to assist clinical decision is becoming more and more important
In this paper, we propose an original approach for biological interpretation of deep learning models for phenotype prediction from gene expression data
These neurons are associated with a list of genes and the corresponding biological knowledge (GO, Kyoto Encyclopedia of Genes and Genomes (KEGG), and Disease ontology annotation lite (DOLite))

Summary

Introduction

The use of predictive gene signatures to assist clinical decision is becoming more and more important. Deep learning has a huge potential in the prediction of phenotype from gene expression profiles. Neural networks are viewed as black boxes, where accurate predictions are provided without any explanation. The requirements for these models to become interpretable are increasing, especially in the medical field. The use of classifiers, constructed from gene expression profiles in clinical research to assist decision making, is Hanczar et al BMC Bioinformatics (2020) 21:501 becoming more and more important. Machine learning methods including support vector machine, random forest and boosting are among the main tools used in making biological discoveries from the huge amount of available gene expression data [1]

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Nov 4, 2020
Citations: 19	License type: open-access

R Discovery Prime

R Discovery Prime

Biological interpretation of deep neural network for phenotype prediction based on gene expression

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Identifying cancer-related microRNAs based on gene expression data.
Xing-Ming Zhao ... Feng He
Bioinformatics | VOL. 31
Xing-Ming Zhao, et. al.Xing-Ming Zhao ... Feng He
12 Dec 2014
Bioinformatics | VOL. 31

Predicting the phenotypic values of physiological traits using SNP genotype and gene expression data in mice.
Yu Takagi ... Hirokazu Matsuda
PLoS ONE | VOL. 9
Yu Takagi, et. al.Yu Takagi ... Hirokazu Matsuda
26 Dec 2014
PLoS ONE | VOL. 9

Classification of breast cancer subtypes by combining gene expression and DNA methylation data.
Jan Mollenhauer ... Richa Batra
Journal of integrative bioinformatics | VOL. 11
Jan Mollenhauer, et. al.Jan Mollenhauer ... Richa Batra
13 Jun 2014
Journal of integrative bioinformatics | VOL. 11

Platelet-derived Growth Factor Stimulates Src-dependent mRNA Stabilization of Specific Early Genes in Fibroblasts
Paul A Bromann ... Sara A Courtneidge
Journal of Biological Chemistry | VOL. 280
Paul A Bromann, et. al.Paul A Bromann ... Sara A Courtneidge
01 Mar 2005
Journal of Biological Chemistry | VOL. 280

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Biological interpretation of deep neural network for phenotype prediction based on gene expression

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics