Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer

Hryhorii Chereda,Andreas Leha,Tim Beißbarth

doi:10.1016/j.artmed.2024.102840

Abstract

High-throughput technologies are becoming increasingly important in discovering prognostic biomarkers and in identifying novel drug targets. With Mammaprint, Oncotype DX, and many other prognostic molecular signatures breast cancer is one of the paradigmatic examples of the utility of high-throughput data to deliver prognostic biomarkers, that can be represented in a form of a rather short gene list. Such gene lists can be obtained as a set of features (genes) that are important for the decisions of a Machine Learning (ML) method applied to high-dimensional gene expression data. Several studies have identified predictive gene lists for patient prognosis in breast cancer, but these lists are unstable and have only a few genes in common. Instability of feature selection impedes biological interpretability: genes that are relevant for cancer pathology should be members of any predictive gene list obtained for the same clinical type of patients. Stability and interpretability of selected features can be improved by including information on molecular networks in ML methods. Graph Convolutional Neural Network (GCNN) is a contemporary deep learning approach applicable to gene expression data structured by a prior knowledge molecular network. Layer-wise Relevance Propagation (LRP) and SHapley Additive exPlanations (SHAP) are methods to explain individual decisions of deep learning models. We used both GCNN+LRP and GCNN+SHAP techniques to construct feature sets by aggregating individual explanations. We suggest a methodology to systematically and quantitatively analyze the stability, the impact on the classification performance, and the interpretability of the selected feature sets. We used this methodology to compare GCNN+LRP to GCNN+SHAP and to more classical ML-based feature selection approaches. Utilizing a large breast cancer gene expression dataset we show that, while feature selection with SHAP is useful in applications where selected features have to be impactful for classification performance, among all studied methods GCNN+LRP delivers the most stable (reproducible) and interpretable gene lists.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial intelligence in medicine	Publication Date: Mar 11, 2024
Citations: 1	License type: cc-by

R Discovery Prime

Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer

Abstract

Published Version

Talk to us

Similar Papers

More From: Artificial intelligence in medicine

Lead the way for us

Similar Papers

Explaining decisions of graph convolutional neural networks for analyses of molecular subnetworks in cancer
Hryhorii Chereda
-
Hryhorii CheredaHryhorii Chereda
03 May 2022
03 May 2022

Hybrid text classification model based on graph convolution network and neural network
Zhaohe Dong ... Zhengli Zhai
-
Zhaohe Dong, et. al.Zhaohe Dong ... Zhengli Zhai
01 Jun 2023
01 Jun 2023

Abstract P1-07-28: Expression of S100A14 promotes cancer cell invasion and metastasis and is associated with poor prognosis in breast cancer
Takashi Sugino ... Takuma Oishi
Cancer Research | VOL. 75
Takashi Sugino, et. al.Takashi Sugino ... Takuma Oishi
30 Apr 2015
Cancer Research | VOL. 75

Explaining a century of Swiss regional development by deep learning and SHAP values
Youxi Lai ... Kay W Axhausen
Environment and Planning B: Urban Analytics and City Science | VOL. 50
Youxi Lai, et. al.Youxi Lai ... Kay W Axhausen
13 Aug 2022
Environment and Planning B: Urban Analytics and City Science | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer

Abstract

Published Version

Talk to us

Similar Papers

More From: Artificial intelligence in medicine