Efficient Replication of Over 180 Genetic Associations with Self‐Reported Medical Data

Uta Francke,Chuong Do,Brian Naughton,Anne Wojcicki,Arnab Chowdry,Nicholas Eriksson,Joanna Naughton,Joyce Tung,J Michael Macpherson,David Hinds,Amy Kiefer

doi:10.1038/npre.2011.6014.1

Abstract

AbstractWhile the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for amassing large amounts of medical information in a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web‐based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in‐depth questions to refine self‐reported diagnoses. Our data suggests that online collection of self‐reported data in a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.

Highlights

In the last few years, the cost of collecting genomic data has declined rapidly
New techniques are needed to complement the wealth of genomic data and build the large cohorts needed for highly-powered genome-wide association studies (GWAS)
Phenotyping error decreases power, which can be problematic as most GWAS are not sufficiently powered to explain a significant fraction of the underlying heritability

Summary

Introduction

In the last few years, the cost of collecting genomic data has declined rapidly. advances in the collection of phenome data (the set of all phenotypic information from a single organism) have not kept pace [1,2]. There is a need for more straightforward methods to quickly and reliably gather retrospective phenotype information from large cohorts of people, to validate existing associations, but to discover new ones. We evaluate a research model in which a large, recontactable cohort is surveyed online across a broad range of phenotypes. Subsets of this cohort with particular characteristics can be contacted for further research with more in-depth phenotyping on specific topics as appropriate. By assessing our ability to replicate previously reported genetic associations across a wide range of conditions, we demonstrate that broad self-reported data collection online is useful for medically-related conditions as well. We show that the ability to recontact the cohort facilitates rapid refinement of phenotype characterization

Results

Discussion

Methods

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Replication of Over 180 Genetic Associations with Self‐Reported Medical Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Precedings

Lead the way for us

Journal: Nature Precedings	Publication Date: Jun 7, 2011
License type: CC BY 3.0

Similar Papers

Efficient Replication of Over 180 Genetic Associations with Self-Reported Medical Data
Uta Francke ... David Hinds
Nature Precedings | VOL. -
Uta Francke, et. al.Uta Francke ... David Hinds
07 Jun 2011
Nature Precedings | VOL. -

Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
Joyce Y Tung ... Chuong B Do
PLoS ONE | VOL. 6
Joyce Y Tung, et. al.Joyce Y Tung ... Chuong B Do
17 Aug 2011
PLoS ONE | VOL. 6

SNPs, Haplotypes, and Cancer: Applications in Molecular Epidemiology
Timothy R Rebbeck ... Fred F Kadlubar
Cancer Epidemiology, Biomarkers & Prevention | VOL. 13
Timothy R Rebbeck, et. al.Timothy R Rebbeck ... Fred F Kadlubar
01 May 2004
Cancer Epidemiology, Biomarkers & Prevention | VOL. 13

Exploring the association between weight loss-inducing medications and multiple sclerosis: insights from the FDA adverse event reporting system database.
Afsaneh Shirani ... Anne H Cross
Therapeutic Advances in Neurological Disorders | VOL. 17
Afsaneh Shirani, et. al.Afsaneh Shirani ... Anne H Cross
01 Jan 2024
Therapeutic Advances in Neurological Disorders | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Replication of Over 180 Genetic Associations with Self‐Reported Medical Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Precedings