Abstract

In solving the gene prioritization problem, ranking candidate genes from most to least promising is attempted before further experimental validation. Integrating the results of various data sources and methods tends to result in a better performance when solving the gene prioritization problem. Therefore, a wide range of datasets and algorithms was investigated; these included topological features of protein networks, physicochemical characteristics and blast similarity scores of protein sequences, gene ontology, biological pathways, and tissue-based data sources. The novelty of this study lies in how the best-performing methods and reliable multi-genomic data sources were applied in an efficient two-step approach. In the first step, various multi-genomic data sources and algorithms were evaluated and seven best-performing rankers were then applied to prioritize candidate genes in different ways. In the second step, global prioritization was obtained by aggregating several scoring schemes.The results showed that protein networks, functional linkage networks, gene ontology, and biological pathway data sources have a significant impact on the quality of the gene prioritization approach. The findings also demonstrated a direct relationship between the degree of genes and the ranking quality of the evaluated tools. This approach outperformed previously published algorithms (e.g., DIR, GPEC, GeneDistiller, and Endeavour) in all evaluation metrices and led to the development of GPS software. Its user-friendly interface and accuracy makes GPS a powerful tool for the identification of human disease genes. GPS is available at http://gpsranker.com and http://LBB.ut.ac.ir.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.