Abstract

BackgroundComplex diseases are characterized as being polygenic and multifactorial, so this poses a challenge regarding the search for genes related to them. With the advent of high-throughput technologies for genome sequencing, gene expression measurements (transcriptome), and protein-protein interactions, complex diseases have been sistematically investigated. Particularly, Protein-Protein Interaction (PPI) networks have been used to prioritize genes related to complex diseases according to its topological features. However, PPI networks are affected by ascertainment bias, in which more studied proteins tend to have more connections, degrading the results quality. Additionally, methods using only PPI networks can provide only static and non-specific results, since the topologies of these networks are not specific of a given disease.ResultsThe goal of this work is to develop a methodology that integrates PPI networks with disease specific data sources, such as GWAS and gene expression, to find genes more specific of a given complex disease. After the integration of PPI networks and gene expression data, the resulting network is used to connect genes related to the disease through the shortest paths that have the greatest concordance between their gene expressions. Both case and control expression data are used separately and, at the end, the most altered genes between the two conditions are selected. To evaluate the method, schizophrenia was adopted as case study.ConclusionResults show that the proposed method successfully retrieves differentially coexpressed genes in two conditions, while avoiding the bias from literature. Moreover we were able to achieve a greater concordance in the selection of important genes from different microarray studies of the same disease and to produce a more specific gene set related to the studied disease.

Highlights

  • Complex diseases are characterized as being polygenic and multifactorial, so this poses a challenge regarding the search for genes related to them

  • Prioritization of relevant genes associated with complex diseases is a significant challenge, because such diseases are polygenic and multifactorial

  • The proposed method could obtain a greater replicability in the selection of important genes from different microarray studies of the same disease and to produce a more specific gene set related to the studied disease

Read more

Summary

Introduction

Complex diseases are characterized as being polygenic and multifactorial, so this poses a challenge regarding the search for genes related to them. With the advent of high-throughput technologies for genome sequencing, gene expression measurements (transcriptome), and protein-protein interactions, complex diseases have been sistematically investigated. Protein-Protein Interaction (PPI) networks have been used to prioritize genes related to complex diseases according to its topological features. PPI networks are affected by ascertainment bias, in which more studied proteins tend to have more connections, degrading the results quality. Methods using only PPI networks can provide only static and non-specific results, since the topologies of these networks are not specific of a given disease

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call