Causal graph-based analysis of genome-wide association data in rheumatoid arthritis

Alexander V Alekseyenko,Leonid Padyukov,Nikita I Lytkin,Bo Ding,Alexander Statnikov,Jizhou Ai,Constantin F Aliferis

doi:10.1186/1745-6150-6-25

Abstract

BackgroundGWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that emphasize contribution of individual SNPs to statistical association with phenotypes. Multivariate methods, however, can extract more information by considering associations of multiple SNPs simultaneously. Recent advances in other genomics domains pinpoint multivariate causal graph-based inference as a promising principled analysis framework for high-throughput data. Designed to discover biomarkers in the local causal pathway of the phenotype, these methods lead to accurate and highly parsimonious multivariate predictive models. In this paper, we investigate the applicability of causal graph-based method TIE* to analysis of GWAS data. To test the utility of TIE*, we focus on anti-CCP positive rheumatoid arthritis (RA) GWAS datasets, where there is a general consensus in the community about the major genetic determinants of the disease.ResultsApplication of TIE* to the North American Rheumatoid Arthritis Cohort (NARAC) GWAS data results in six SNPs, mostly from the MHC locus. Using these SNPs we develop two predictive models that can classify cases and disease-free controls with an accuracy of 0.81 area under the ROC curve, as verified in independent testing data from the same cohort. The predictive performance of these models generalizes reasonably well to Swedish subjects from the closely related but not identical Epidemiological Investigation of Rheumatoid Arthritis (EIRA) cohort with 0.71-0.78 area under the ROC curve. Moreover, the SNPs identified by the TIE* method render many other previously known SNP associations conditionally independent of the phenotype.ConclusionsOur experiments demonstrate that application of TIE* captures maximum amount of genetic information about RA in the data and recapitulates the major consensus findings about the genetic factors of this disease. In addition, TIE* yields reproducible markers and signatures of RA. This suggests that principled multivariate causal and predictive framework for GWAS analysis empowers the community with a new tool for high-quality and more efficient discovery.ReviewersThis article was reviewed by Prof. Anthony Almudevar, Dr. Eugene V. Koonin, and Prof. Marianthi Markatou.

Highlights

Genome-wide association studies (GWAS) owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes
In the remainder of this paper, we describe the application of a state-of-the-art causal graph-based algorithm TIE* to anti-CCP positive rheumatoid arthritis GWAS data [12]
Patterns of missing data in North American Rheumatoid Arthritis Cohort (NARAC) cohort are not random with respect to the phenotype On our initial examination of NARAC GWAS data we have noticed that, even after applying data completeness filtering, many SNPs still contain a large number of missing genotype calls

Summary

Introduction

GWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that emphasize contribution of individual SNPs to statistical association with phenotypes. Genome-wide association studies (GWAS) are considered to be one of the primary tools for determining genetic links to disease. GWAS have been abundant in recent scientific research with more than 900 primary large-scale studies performed in the last 5 years. Each of these studies has genotyped at least 100,000 single. Multivariate methods take a step forward by shifting the focus on how combining the individual SNP signals can help classify the phenotypes, and uncover additional evidence for possible genetic risk factors. Bayesian networks, kernel-based classifiers and multivariate regression are making their way as candidate new methodologies for the analysis of GWAS data [6,7,8,9,10,11]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biology Direct	Publication Date: Jan 1, 2011
Citations: 70	License type: cc-by

R Discovery Prime

R Discovery Prime

Causal graph-based analysis of genome-wide association data in rheumatoid arthritis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology Direct

Lead the way for us

Similar Papers

Early causal heterogeneity but developmental homogeneity before schizophrenia. A prospective study of early life in 15000 people from two general population birth cohorts
Peter Jones ... Matti Isohanni
-
Peter Jones, et. al.Peter Jones ... Matti Isohanni
01 Jan 1997
01 Jan 1997

Low levels of antibodies against common viruses associate with anti-citrullinated protein antibody-positive rheumatoid arthritis; implications for disease aetiology
Natalia Sherina ... Hulda S Hreggvidsdottir
Arthritis Research & Therapy | VOL. 19
Natalia Sherina, et. al.Natalia Sherina ... Hulda S Hreggvidsdottir
30 Sep 2017
Arthritis Research & Therapy | VOL. 19

SAT0033 Characterization of the Multi-Biomarker Disease Activity (Vectra Da™ Algorithm) Score in a Subgroup of Patients from the Epidemiological Investigation of Rheumatoid Arthritis (EIRA) Cohort Receiving Methotrexate
W Li ... A I Catrina
Annals of the Rheumatic Diseases | VOL. 72
W Li, et. al.W Li ... A I Catrina
01 Jun 2013
Annals of the Rheumatic Diseases | VOL. 72

POS0392 PRESENCE OF FOUR SERUM AUTOANTIBODIES ASSOCIATES WITH THE ACPA STATUS IN EARLY RHEUMATOID ARTHRITIS
L Lourido ... R Sjöberg
-
L Lourido, et. al.L Lourido ... R Sjöberg
19 May 2021
POS0392 PRESENCE OF FOUR SERUM AUTOANTIBODIES ASSOCIATES WITH THE ACPA STATUS IN EARLY RHEUMATOID ARTHRITIS
L Lourido ... R Sjöberg

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Causal graph-based analysis of genome-wide association data in rheumatoid arthritis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology Direct