DA DA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization

Sinan Erten,Mehmet Koyutürk,Rob M Ewing,Gurkan Bebek

doi:10.1186/1756-0381-4-19

Sinan Erten, Mehmet Koyutürk + Show 2 more

Open Access

https://doi.org/10.1186/1756-0381-4-19

Copy DOI

Journal: BioData Mining	Publication Date: Jun 24, 2011
Citations: 187	License type: CC BY 2.0

Affiliation: Case Western Reserve University

Abstract

BackgroundHigh-throughput molecular interaction data have been used effectively to prioritize candidate genes that are linked to a disease, based on the observation that the products of genes associated with similar diseases are likely to interact with each other heavily in a network of protein-protein interactions (PPIs). An important challenge for these applications, however, is the incomplete and noisy nature of PPI data. Information flow based methods alleviate these problems to a certain extent, by considering indirect interactions and multiplicity of paths.ResultsWe demonstrate that existing methods are likely to favor highly connected genes, making prioritization sensitive to the skewed degree distribution of PPI networks, as well as ascertainment bias in available interaction and disease association data. Motivated by this observation, we propose several statistical adjustment methods to account for the degree distribution of known disease and candidate genes, using a PPI network with associated confidence scores for interactions. We show that the proposed methods can detect loosely connected disease genes that are missed by existing approaches, however, this improvement might come at the price of more false negatives for highly connected genes. Consequently, we develop a suite called DADA, which includes different uniform prioritization methods that effectively integrate existing approaches with the proposed statistical adjustment strategies. Comprehensive experimental results on the Online Mendelian Inheritance in Man (OMIM) database show that DADA outperforms existing methods in prioritizing candidate disease genes.ConclusionsThese results demonstrate the importance of employing accurate statistical models and associated adjustment methods in network-based disease gene prioritization, as well as other network-based functional inference applications. DADA is implemented in Matlab and is freely available at http://compbio.case.edu/dada/.

Highlights

Identification of disease-associated genes is an important step toward enhancing our understanding of the cellular mechanisms that drive human diseases, with profound applications in modeling, diagnosis, prognosis, and therapeutic intervention [1]
Several algorithms have been proposed to incorporate topological properties of protein-protein interactions (PPIs) networks in understanding genetic diseases [3,8,13]. These algorithms mostly focus on prioritization of candidate genes and mainly exploit the notion that the products of genes associated with similar diseases have a higher chance of being connected in the network of PPIs
We use three reference models that take into account the degree distribution of the PPI network: (i) reference model based on degree distribution of known disease gene products, (ii) reference model based on the degree of candidate gene products, and (iii) likelihood ratio test using eigenvector centrality as the reference model

Summary

Introduction

Identification of disease-associated genes is an important step toward enhancing our understanding of the cellular mechanisms that drive human diseases, with profound applications in modeling, diagnosis, prognosis, and therapeutic intervention [1]. Several algorithms have been proposed to incorporate topological properties of PPI networks in understanding genetic diseases [3,8,13] These algorithms mostly focus on prioritization of candidate genes and mainly exploit the notion that the products of genes associated with similar diseases have a higher chance of being connected in the network of PPIs. an important challenge for these applications is the incomplete and noisy nature of the PPI data [15]. Network-based candidate disease gene prioritization There exists a wide range of disease gene prioritization methods that are based on the analysis of the topological properties of PPI networks These methods commonly rely on the observation that the products of genes that are associated with similar diseases have a higher likelihood of physically interacting [11]. Any reference to interactions between genes in this paper refers to the interactions between their products

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DA DA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining

Lead the way for us

Similar Papers

Role of Centrality in Network-Based Prioritization of Disease Genes
Sinan Erten ... Mehmet Koyutürk
-
Sinan Erten, et. al.Sinan Erten ... Mehmet Koyutürk
01 Jan 2009
01 Jan 2009

Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks.
Sinan Erten ... Mehmet Koyutürk
Journal of Computational Biology | VOL. 18
Sinan Erten, et. al.Sinan Erten ... Mehmet Koyutürk
28 Oct 2011
Journal of Computational Biology | VOL. 18

Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model.
Jingchao Ni ... Hanghang Tong
BMC Bioinformatics | VOL. 17
Jingchao Ni, et. al.Jingchao Ni ... Hanghang Tong
10 Nov 2016
BMC Bioinformatics | VOL. 17

Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression
Shao-Wu Zhang ... Song-Yao Zhang
Mol. BioSyst. | VOL. 10
Shao-Wu Zhang, et. al.Shao-Wu Zhang ... Song-Yao Zhang
01 Jan 2014
Mol. BioSyst. | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DA DA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining