Disease Gene Prioritization Research Articles

BackgroundIdentification of driver genes related to certain types of cancer is an important research topic. Several systems biology approaches have been suggested, in particular for the identification of breast cancer (BRCA) related genes. Such approaches usually rely on differential gene expression and/or mutational landscape data. In some cases interaction network data is also integrated to identify cancer-related modules computationally.ResultsWe provide a framework for the comparative graph-theoretical analysis of networks integrating the relevant gene expression, mutations, and potein-protein interaction network data. The comparisons involve a graph-theoretical analysis of normal and tumor network pairs across all instances of a given set of breast cancer samples. The network measures under consideration are based on appropriate formulations of various centrality measures: betweenness, clustering coefficients, degree centrality, random walk distances, graph-theoretical distances, and Jaccard index centrality.ConclusionsAmong all the studied centrality-based graph-theoretical properties, we show that a betweenness-based measure differentiates BRCA genes across all normal versus tumor network pairs, than the rest of the popular centrality-based measures. The AUROC and AUPR values of the gene lists ordered with respect to the measures under study as compared to NCBI BioSystems pathway and the COSMIC database of cancer genes are the largest with the betweenness-based differentiation, followed by the measure based on degree centrality. In order to test the robustness of the suggested measures in prioritizing cancer genes, we further tested the two most promising measures, those based on betweenness and degree centralities, on randomly rewired networks. We show that both measures are quite resilient to noise in the input interaction network. We also compared the same measures against a state-of-the-art alternative disease gene prioritization method, MUFFFINN. We show that both our graph-theoretical measures outperform MUFFINN prioritizations in terms of ROC and precions/recall analysis. Finally, we filter the ordered list of the best measure, the betweenness-based differentiation, via a maximum-weight independent set formulation and investigate the top 50 genes in regards to literature verification. We show that almost all genes in the list are verified by the breast cancer literature and three genes are presented as novel genes that may potentialy be BRCA-related but missing in literature.

BackgroundAccurately prioritizing candidate disease genes is an important and challenging problem. Various network-based methods have been developed to predict potential disease genes by utilizing the disease similarity network and molecular networks such as protein interaction or gene co-expression networks. Although successful, a common limitation of the existing methods is that they assume all diseases share the same molecular network and a single generic molecular network is used to predict candidate genes for all diseases. However, different diseases tend to manifest in different tissues, and the molecular networks in different tissues are usually different. An ideal method should be able to incorporate tissue-specific molecular networks for different diseases.ResultsIn this paper, we develop a robust and flexible method to integrate tissue-specific molecular networks for disease gene prioritization. Our method allows each disease to have its own tissue-specific network(s). We formulate the problem of candidate gene prioritization as an optimization problem based on network propagation. When there are multiple tissue-specific networks available for a disease, our method can automatically infer the relative importance of each tissue-specific network. Thus it is robust to the noisy and incomplete network data. To solve the optimization problem, we develop fast algorithms which have linear time complexities in the number of nodes in the molecular networks. We also provide rigorous theoretical foundations for our algorithms in terms of their optimality and convergence properties. Extensive experimental results show that our method can significantly improve the accuracy of candidate gene prioritization compared with the state-of-the-art methods.ConclusionsIn our experiments, we compare our methods with 7 popular network-based disease gene prioritization algorithms on diseases from Online Mendelian Inheritance in Man (OMIM) database. The experimental results demonstrate that our methods recover true associations more accurately than other methods in terms of AUC values, and the performance differences are significant (with paired t-test p-values less than 0.05). This validates the importance to integrate tissue-specific molecular networks for studying disease gene prioritization and show the superiority of our network models and ranking algorithms toward this purpose. The source code and datasets are available at http://nijingchao.github.io/CRstar/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1317-x) contains supplementary material, which is available to authorized users.

Disease Gene Prioritization Research Articles

Related Topics

Articles published on Disease Gene Prioritization

Improved ontology-based similarity calculations using a study-wise annotation model.

Integrating phenotype ontologies with PhenomeNET

Graph-theoretical comparison of normal and tumor networks in identifying BRCA genes

Effect of Aggregation Operators on Network-Based Disease Gene Prioritization: A Case Study on Blood Disorders.

S-FLN: A sequence-based hierarchical approach for functional linkage network construction

Network propagation in the cytoscape cyberinfrastructure

Arete \u2013 candidate gene prioritization using biological network topology with additional evidence types

Construction of reliable heterogeneous network using protein sequence similarity for the prioritization of candidate disease genes

Loss of Conservation of Graph Centralities in Reverse-engineered Transcriptional Regulatory Networks

GenePANDA\u2014a novel network-based gene prioritizing tool for complex diseases

PERCH: A Unified Framework for Disease Gene Prioritization.

Linearity of network proximity measures: implications for set-based queries and significance testing

A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

Gene co-opening network deciphers gene functional relationships.

An improved method for functional similarity analysis of genes based on Gene Ontology.

Integrated Post-GWAS Analysis Sheds New Light on the Disease Mechanisms of Schizophrenia.

Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model.

Integrating phenotypic features and tissue-specific information to prioritize disease genes

Prioritizing Disease Genes by Using Search Engine Algorithm

Efficient and biologically relevant consensus strategy for Parkinson's disease gene prioritization.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Disease Gene Prioritization Research Articles

Related Topics

Articles published on Disease Gene Prioritization

Improved ontology-based similarity calculations using a study-wise annotation model.

Integrating phenotype ontologies with PhenomeNET

Graph-theoretical comparison of normal and tumor networks in identifying BRCA genes

Effect of Aggregation Operators on Network-Based Disease Gene Prioritization: A Case Study on Blood Disorders.

S-FLN: A sequence-based hierarchical approach for functional linkage network construction

Network propagation in the cytoscape cyberinfrastructure

Arete \u2013 candidate gene prioritization using biological network topology with additional evidence types

Construction of reliable heterogeneous network using protein sequence similarity for the prioritization of candidate disease genes

Loss of Conservation of Graph Centralities in Reverse-engineered Transcriptional Regulatory Networks

GenePANDA\u2014a novel network-based gene prioritizing tool for complex diseases

PERCH: A Unified Framework for Disease Gene Prioritization.

Linearity of network proximity measures: implications for set-based queries and significance testing

A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

Gene co-opening network deciphers gene functional relationships.

An improved method for functional similarity analysis of genes based on Gene Ontology.

Integrated Post-GWAS Analysis Sheds New Light on the Disease Mechanisms of Schizophrenia.

Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model.

Integrating phenotypic features and tissue-specific information to prioritize disease genes

Prioritizing Disease Genes by Using Search Engine Algorithm

Efficient and biologically relevant consensus strategy for Parkinson's disease gene prioritization.