Abstract

The correct identification of the driver genes that lead to cancer development is essential for understanding the mechanisms of cancer and developing drugs to treat it. Currently, most computational methods for identifying cancer driver genes are based on a cohort of patients. However, due to the heterogeneity of cancers, patients diagnosed with the same cancers may have different genomic characteristics and present varied clinical symptoms. It requires devising effective methods to identify personalized cancer driver genes in an individual. This work developed a novel method to predict personalized cancer driver genes of a single sample based on graph convolution networks, namely pDriverGCN. pDriverGCN constructed a mutant gene-sample heterogeneous network according to the known driver genes of samples. Then it employed two separate graph convolution network models to learn feature representations for genes and samples by gathering the features of themselves and their neighbors. Finally, pDriverGCN used the feature representations to reconstruct the association matrix between genes and samples through a linear correlation coefficient decoder. We apply our model to identify personalized driver genes of samples on the TCGA datasets. The experimental results show that our model outperforms state-of-the-art methods being evaluated at both population and individual levels.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call