Prioritizing Disease Genes by Using Search Engine Algorithm

Min Li,Zhuohua Zhang,Ruiqing Zheng,Jianxin Wang,Qi Li,Fang-Xiang Wu

doi:10.2174/1574893611666160125220905

Abstract

It is a fundamental challenge that identifying disease genes from a large number of candidates for a specific disease. As the biological experiment-based methods are generally timeconsuming and laborious, it has become a new strategy to identify disease candidates by using computational approaches. In this paper, we proposed an algorithm based on the search engine ranking method, named PDGTR, to prioritize disease candidates. Firstly, we constructed a weighted human disease network by calculating the topological similarity and phenotype similarity of each pair of diseases. Then, we calculated the similarities of all the genes by using the protein-protein interaction network and the edge clustering coefficient. For a specific disease, a logistic regression model was used to generate the prior-knowledge of each gene. Finally, the search engine ranking based algorithm PDGTR was applied to prioritize the disease candidates. The proposed algorithm PDGTR was tested on five typical cancers: Breast Cancer, Colorectal Cancer, Hepatocellular carcinoma, Gastric Cancer and Osteoporosis, and compared with four state-of-the-art algorithms: RWR, DADA, PRINCE and PRP. The experimental results based on the leave-one-out cross validation, precision, ROC curve, and enrichment show that the proposed algorithm PDGTR outperforms RWR, DADA, PRINCE and PRP. Moreover, some potential disease genes were predicted by PDGTR and already mentioned by some literatures. Keywords: systems biology, protein-protein interaction network, disease gene, search engine algorithm, random walk, disease similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prioritizing Disease Genes by Using Search Engine Algorithm

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics

Lead the way for us

Journal: Current Bioinformatics	Publication Date: Apr 1, 2016
Citations: 21

Similar Papers

A algorithm for identifying disease genes by incorporating the subcellular localization information into the protein-protein interaction networks
Xiwei Tang ... Yuan Sun
-
Xiwei Tang, et. al. Xiwei Tang ... Yuan Sun
01 Dec 2016
01 Dec 2016

Inferring Gene-Phenotype Associations via Global Protein Complex Network Propagation
Peng Yang ... Xiaoli Li
PLoS ONE | VOL. 6
Peng Yang, et. al.Peng Yang ... Xiaoli Li
25 Jul 2011
PLoS ONE | VOL. 6

Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction
Xiao-Dong Wang ... Dong-Qing Wei
PLoS ONE | VOL. 9
Xiao-Dong Wang, et. al.Xiao-Dong Wang ... Dong-Qing Wei
22 Jan 2014
PLoS ONE | VOL. 9

Prioritizing Disease Genes by Bi-Random Walk
Maoqiang Xie ... Taehyun Hwang
-
Maoqiang Xie, et. al.Maoqiang Xie ... Taehyun Hwang
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prioritizing Disease Genes by Using Search Engine Algorithm

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics