Abstract

BackgroundThe majority of common diseases are multi-factorial and modified by genetically and mechanistically complex polygenic interactions and environmental factors. High-throughput genome-wide studies like linkage analysis and gene expression profiling, tend to be most useful for classification and characterization but do not provide sufficient information to identify or prioritize specific disease causal genes.ResultsExtending on an earlier hypothesis that the majority of genes that impact or cause disease share membership in any of several functional relationships we, for the first time, show the utility of mouse phenotype data in human disease gene prioritization. We study the effect of different data integration methods, and based on the validation studies, we show that our approach, ToppGene , outperforms two of the existing candidate gene prioritization methods, SUSPECTS and ENDEAVOUR.ConclusionThe incorporation of phenotype information for mouse orthologs of human genes greatly improves the human disease candidate gene analysis and prioritization.

Highlights

  • The majority of common diseases are multi-factorial and modified by genetically and mechanistically complex polygenic interactions and environmental factors

  • Mouse phenotype as a feature for candidate gene prioritization The Mammalian Phenotype (MP) Ontology enables robust annotation of mammalian phenotypes in the context of mutations, quantitative trait loci and strains that are used as models of human biology and disease

  • We demonstrate that ToppGene performs better than SUSPECTS, PROSEPCTR and ENDEAVOUR in candidate gene prioritization

Read more

Summary

Introduction

The majority of common diseases are multi-factorial and modified by genetically and mechanistically complex polygenic interactions and environmental factors. Majority of the common diseases are genetically intricate, polygenic and multifactorial, and frequently manifest as different clinical phenotypes These complex conditions are often triggered by an interaction of genetic, environmental, and physiological factors, making it difficult for researchers to narrow their focus to a single or few genes. High-throughput genome-wide studies like linkage analysis and gene expression profiling useful for classification and characterization do not provide sufficient information to identify specific disease causal genes. Both of these approaches typically result in hundreds of potential candidate genes, failing to help the researchers in reducing the target genes to a manageable number for further validation. Functional enrichment approaches [2,3,4] focusing on gene sets that share common biological function, chromo-

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.