PAN: Personalized Annotation-Based Networks for the Prediction of Breast Cancer Relapse.

Thin Nguyen,Xiaomei Li,Truyen Tran,Buu Truong,Svetha Venkatesh,Thomas P Quinn,Thuc Duy Le,Samuel C Lee

doi:10.1109/tcbb.2021.3076422

Abstract

The classification of clinical samples based on gene expression data is an important part of precision medicine. In this manuscript, we show how transforming gene expression data into a set of personalized (sample-specific) networks can allow us to harness existing graph-based methods to improve classifier performance. Existing approaches to personalized gene networks have the limitation that they depend on other samples in the data and must get re-computed whenever a new sample is introduced. Here, we propose a novel method, called Personalized Annotation-based Networks (PAN), that avoids this limitation by using curated annotation databases to transform gene expression data into a graph. Unlike competing methods, PANs are calculated for each sample independent of the population, making it a more efficient way to obtain single-sample networks. Using three breast cancer datasets as a case study, we show that PAN classifiers not only predict cancer relapse better than gene features alone, but also outperform PPI (protein-protein interactions) and population-level graph-based classifiers. This work demonstrates the practical advantages of graph-based classification for high-dimensional genomic data, while offering a new approach to making sample-specific networks. Supplementary information: PAN and the baselines are implemented in Python. Source code and data are available at https://github.com/thinng/PAN.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Apr 28, 2021
Citations: 7	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

PAN: Personalized Annotation-Based Networks for the Prediction of Breast Cancer Relapse.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Similar Papers

The effect of principal component analysis on machine learning accuracy with high-dimensional spectral data
Tom Howley ... Alan G Ryder
Knowledge-Based Systems | VOL. 19
Tom Howley, et. al.Tom Howley ... Alan G Ryder
08 Feb 2006
Knowledge-Based Systems | VOL. 19

Improving the classification of high dimensional class-imbalanced data using the Chaos particle swarm optimization with Levy Flight
Mohammad Ali Zarif ... Javad Hamidzadeh
-
Mohammad Ali Zarif, et. al.Mohammad Ali Zarif ... Javad Hamidzadeh
28 Oct 2021
28 Oct 2021

Naive Bayes combined with partial least squares for classification of high dimensional microarray data
Tahir Mehmood ... Muhammad Moeen Butt
Chemometrics and Intelligent Laboratory Systems | VOL. 222
Tahir Mehmood, et. al.Tahir Mehmood ... Muhammad Moeen Butt
13 Jan 2022
Chemometrics and Intelligent Laboratory Systems | VOL. 222

The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data
Tom Howley ... Michael G Madden
-
Tom Howley, et. al.Tom Howley ... Michael G Madden
12 Dec 2005
12 Dec 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PAN: Personalized Annotation-Based Networks for the Prediction of Breast Cancer Relapse.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics