Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19.

Justo Lorenzo Bermejo

doi:10.1186/s12863-015-0310-0

Abstract

This paper summarizes the contributions from the Population-Based Association group at the Genetic Analysis Workshop 19. It provides an overview of the new statistical approaches tried out by group members in order to take best advantage of population-based sequence data.Although contributions were highly heterogeneous regarding the applied quality control criteria and the number of investigated variants, several technical issues were identified, leading to practical recommendations. Preliminary analyses revealed that Hurdle-negative binomial regression is a promising approach to investigate the distribution of allele counts instead of called genotypes from sequence data. Convergence problems, however, limited the use of this approach, creating a technical challenge shared by environment-stratified models used to investigate rare variant-environment interactions, as well as by rare variant haplotype analyses using well-established public software. Estimates of relatedness and population structure strongly depended on the allele frequency of selected variants for inference. Another practical recommendation was that dissenting probability values from standard and small-sample tests of a particular hypothesis may reflect a lack of validity of large-sample approximations. Novel statistical approaches that integrate evolutionary information showed some advantage to detect weak genetic signals, and Bayesian adjustment for confounding was able to efficiently estimate causal genetic effects. Haplotype association methods may constitute a valuable complement of collapsing approaches for sequence data. This paper reports on the experience of members of the Population-Based Association group with several novel, promising approaches to preprocessing and analyzing sequence data, and to following up identified association signals.

Highlights

Every 2 years, participants of the Genetic Analysis Workshop (GAW) explore a common data set using novel approaches and summarize their findings in a short paper
Panel C represents a histogram of alternative allele counts for the variant in position Ch3:16249998, which presented a median count of 254 (40.03)
Panel D compares the distributions of the ratio “alternative allele count/number of reads” and called genotypes

Summary

Introduction

Every 2 years, participants of the Genetic Analysis Workshop (GAW) explore a common data set using novel approaches and summarize their findings in a short paper. Contributions to the GAW19, held August 24–27, 2014, in Vienna, Austria, were split up by workshop organizers into 9 thematic groups. Members of the Population-Based Association group worked in pairs in the weeks preceding the GAW. Each participant contacted the other pair member, read the preliminary version of his/her individual contribution, and discussed findings and results with him/her. On the Lorenzo Bermejo BMC Genetics 2016, 17(Suppl 2): first day of the workshop, group members briefly presented the contributions of the other pair member. Engaged discussions and intensive team work during 4 group meetings and a poster session led to a consensus summary of the group contributions, which was presented to all GAW participants in a plenary session

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genetics	Publication Date: Feb 3, 2016
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genetics

Lead the way for us

Similar Papers

Real-Time Analysis and Visualization of Pathogen Sequence Data.
Richard A Neher ... Colleen Suzanne Kraft
Journal of Clinical Microbiology | VOL. 56
Richard A Neher, et. al.Richard A Neher ... Colleen Suzanne Kraft
25 Oct 2018
Journal of Clinical Microbiology | VOL. 56

Spectral analysis of phylogenetic data
Michael D Hendy ... David Penny
Journal of Classification | VOL. 10
Michael D Hendy, et. al.Michael D Hendy ... David Penny
01 Jan 1992
Journal of Classification | VOL. 10

Using next-generation DNA sequence data for genetic association tests based on allele counts with and without consideration of zero inflation.
Rosa González Silos ... Barbara Peil
BMC Proceedings | VOL. 10
Rosa González Silos, et. al.Rosa González Silos ... Barbara Peil
01 Oct 2016
BMC Proceedings | VOL. 10

Using single-worm RNA sequencing to study C. elegans responses to pathogen infection
Archer J Wang ... Phillip Wibisono
BMC Genomics | VOL. 23
Archer J Wang, et. al.Archer J Wang ... Phillip Wibisono
14 Sep 2022
BMC Genomics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genetics