Ranking cancer drivers via betweenness-based outlier detection and random walks

Cesim Erten,Aissa Houdjedj,Hilal Kazan

doi:10.1186/s12859-021-03989-w

Abstract

BackgroundRecent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes.ResultsWe propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets.ConclusionsOur evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.

Highlights

Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients
One contribution of BetweenNet is the identification of patient specific dysregulated genes with a measure based on betweenness centrality on personalized networks
A bipartite influence graph is formed to represent the relations between the mutated genes and dysregulated genes in each patient. Another contribution of BetweenNet is the employment of a randomwalk process on the resulting influence bipartite graph

Summary

Introduction

Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. Erten et al BMC Bioinformatics (2021) 22:62 genes or driver modules of genes by integrating mutations data with various other types of genetic data [3,4,5,6,7,8,9,10]; see [11,12,13,14] for recent comprehensive evaluations and surveys on the topic. Rather than outputting a set of candidate driver genes or modules, a subclass of cancer driver identification methods output a prioritized list of genes ranked by their cancer driving potential. Approaches in this group have utilized the mutation frequency of each gene by comparing with background mutation rates [15,16,17]. With a careful review of the existing cancer catalogues it is easy to observe that most tumors share only a small portion of the set of all mutated genes, giving rise to the so called tumor heterogeneity problem; methods solely based on mutation rates suffer from low sensitivity due to the existence of long-tail of infrequently mutated genes [4, 18]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Feb 10, 2021
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Ranking cancer drivers via betweenness-based outlier detection and random walks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Journeys into the genome of cancer cells
Michael R Stratton
EMBO Molecular Medicine | VOL. 5
Michael R StrattonMichael R Stratton
22 Jan 2013
EMBO Molecular Medicine | VOL. 5

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.
Dana Silverbush ... Simona Cristea
Cell Systems | VOL. 8
Dana Silverbush, et. al.Dana Silverbush ... Simona Cristea
01 May 2019
Cell Systems | VOL. 8

Pervasive conditional selection of driver mutations and modular epistasis networks in cancer.
Jaime Iranzo ... George Gruenhagen
Cell Reports | VOL. 40
Jaime Iranzo, et. al.Jaime Iranzo ... George Gruenhagen
01 Aug 2022
Cell Reports | VOL. 40

FrDriver: A Functional Region Driver Identification for Protein Sequence.
Xinguo Lu ... Li Ding
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 18
Xinguo Lu, et. al.Xinguo Lu ... Li Ding
01 Sep 2020
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ranking cancer drivers via betweenness-based outlier detection and random walks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics