Abstract

Since the completion of the Human Genome Project, the field of human genetics has been in great flux, largely due to technological advances in studying DNA sequence variation. Although community-wide adoption of statistical standards was key to the success of genome-wide association studies, similar standards have not yet been globally applied to the processing and interpretation of sequencing data. It has proven particularly challenging to pinpoint unequivocally disease variants in sequencing studies of polygenic traits. Here, we comment on a number of factors that may contribute to irreproducible claims of association in scientific literature and discuss possible steps that we can take towards cultural change.

Highlights

  • Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands

  • The study of complex traits and disease had previously been limited to genetic linkage studies, typically laborious efforts limited to constructing linkage maps in families and powered for discovering highly penetrant variants

  • A single de novo event from one individual, even if the variant is functional, is insufficient evidence to claim a role in disease susceptibility [60], and extensive replication efforts will be necessary to determine whether the amyotrophic lateral sclerosis (ALS)-CREST association is real

Read more

Summary

The Evolution of Genetic Association Studies

The Human Genome Project [1] remains the largest scientific endeavor in the biological sciences, spanning thirteen years, requiring hundreds of researchers around the globe, and costing $3 billion. Reviews of candidate gene studies found that small sample sizes, population stratification issues, weak effects, and a lack of statistical evidence for the claimed associations were common [3,4,5,6] All of these problems contributed to the irreproducibility of initial findings; one review found that of 166 associations with more than two follow-up studies, only six (3.6%) replicated [3]. Geneticists could test millions of common variants across the genome for association with the trait of interest Systematic cataloguing of these variants [9] allowed for the design of genome-wide SNP arrays, allowing for low-cost capture of tens of thousands of variants in large samples.

Sources of Error and Bias in Genetic Research
Next-Generation Sequencing
Lacking Evidence
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call