Abstract

BackgroundLack of power and reproducibility are caveats of genetic association studies of common complex diseases. Indeed, the heterogeneity of disease etiology demands that causal models consider the simultaneous involvement of multiple genes. Rothman’s sufficient-cause model, which is well known in epidemiology, provides a framework for such a concept. In the present work, we developed a three-stage algorithm to construct gene clusters resembling Rothman’s causal model for a complex disease, starting from finding influential gene pairs followed by grouping homogeneous pairs.ResultsThe algorithm was trained and tested on 2,772 hypertensives and 6,515 normotensives extracted from four large Caucasian and Taiwanese databases. The constructed clusters, each featured by a major gene interacting with many other genes and identified a distinct group of patients, reproduced in both ethnic populations and across three genotyping platforms. We present the 14 largest gene clusters which were capable of identifying 19.3% of hypertensives in all the datasets and 41.8% if one dataset was excluded for lack of phenotype information. Although a few normotensives were also identified by the gene clusters, they usually carried less risky combinatory genotypes (insufficient causes) than the hypertensive counterparts. After establishing a cut-off percentage for risky combinatory genotypes in each gene cluster, the 14 gene clusters achieved a classification accuracy of 82.8% for all datasets and 98.9% if the information-short dataset was excluded. Furthermore, not only 10 of the 14 major genes but also many other contributing genes in the clusters are associated with either hypertension or hypertension-related diseases or functions.ConclusionsWe have shown with the constructed gene clusters that a multi-causal pie-multi-component approach can indeed improve the reproducibility of genetic markers for complex disease. In addition, our novel findings including a major gene in each cluster and sufficient risky genotypes in a cluster for disease onset (which coincides with Rothman’s sufficient cause theory) may not only provide a new research direction for complex diseases but also help to reveal the disease etiology.

Highlights

  • Lack of power and reproducibility are caveats of genetic association studies of common complex diseases

  • We focused on constructing gene clusters resembling genetic causal pies using genome-wide single-nucleotide polymorphism (SNP) data for youngonset hypertension (YOH), which has a stronger genetic attribute than its late-onset counterpart [16,17]

  • We aimed to find either single Single-nucleotide polymorphism (SNP) or multiple SNP sets each of which resembles a genetic causal pie and could distinguish a certain proportion of hypertensives (HTs) from normotensive controls (NCs)

Read more

Summary

Introduction

Lack of power and reproducibility are caveats of genetic association studies of common complex diseases. Rothman’s sufficient-cause model, which is well known in epidemiology, provides a framework for such a concept. The probability of developing a disease is increased as a person carries more and more component causes Under such a conceptual framework, if a gene is only involved in few of the causal pies which explain only a fraction of disease population, its effect toward a disease could be insignificant when all patients are considered. This model provides an explanation for low reproducibility across studies. Rothman’s causal pie model is well known in epidemiology, few attempts have been made to construct such pies, not to mention its recognition and application in the genetic field

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.