Enhancements to the ADMIXTURE algorithm for individual ancestry estimation

David H Alexander,Kenneth Lange

doi:10.1186/1471-2105-12-246

David H Alexander, Kenneth Lange

Open Access

https://doi.org/10.1186/1471-2105-12-246

Copy DOI

Abstract

BackgroundThe estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology. Software programs for calculating ancestry estimates have become essential tools in the geneticist's analytic arsenal.ResultsHere we describe four enhancements to ADMIXTURE, a high-performance tool for estimating individual ancestries and population allele frequencies from SNP (single nucleotide polymorphism) data. First, ADMIXTURE can be used to estimate the number of underlying populations through cross-validation. Second, individuals of known ancestry can be exploited in supervised learning to yield more precise ancestry estimates. Third, by penalizing small admixture coefficients for each individual, one can encourage model parsimony, often yielding more interpretable results for small datasets or datasets with large numbers of ancestral populations. Finally, by exploiting multiple processors, large datasets can be analyzed even more rapidly.ConclusionsThe enhancements we have described make ADMIXTURE a more accurate, efficient, and versatile tool for ancestry estimation.

Highlights

The estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology
The effectiveness of cross-validation Figure 1 demonstrates the effectiveness of cross-validation on several datasets culled from HapMap 3 [10]
While we have not performed extensive simulation studies, our experience has shown that the success of cross-validation depends in part on the degree of differentiation between the populations under study as quantified by Wright’s fixation index FST

Summary

Results

We describe four enhancements to ADMIXTURE, a high-performance tool for estimating individual ancestries and population allele frequencies from SNP (single nucleotide polymorphism) data. ADMIXTURE can be used to estimate the number of underlying populations through cross-validation. Individuals of known ancestry can be exploited in supervised learning to yield more precise ancestry estimates. By penalizing small admixture coefficients for each individual, one can encourage model parsimony, often yielding more interpretable results for small datasets or datasets with large numbers of ancestral populations. By exploiting multiple processors, large datasets can be analyzed even more rapidly

Background

Implementation

Results and Discussion

Conclusion

Wold S

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 18, 2011
Citations: 1059	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Enhancements to the ADMIXTURE algorithm for individual ancestry estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.
Vikas Bansal ... Ondrej Libiger
BMC Bioinformatics | VOL. 16
Vikas Bansal, et. al.Vikas Bansal ... Ondrej Libiger
16 Jan 2015
BMC Bioinformatics | VOL. 16

Examining population stratification via individual ancestry estimates versus self-reported race.
Jill S Barnholtz-Sloan ... Thomas A Sellers
Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology | VOL. 14
Jill S Barnholtz-Sloan, et. al.Jill S Barnholtz-Sloan ... Thomas A Sellers
01 Jun 2005
01 Jun 2005

Population analysis of vitamin D receptor polymorphisms and the role of genetic ancestry in an admixed population
Tulio C Lins ... Rodrigo G Vieira
Genetics and Molecular Biology | VOL. 34
Tulio C Lins, et. al.Tulio C Lins ... Rodrigo G Vieira
01 Jan 2010
Genetics and Molecular Biology | VOL. 34

Informativeness of dental morphology in ancestry estimation in African Americans.
Jessica M Gross ... Heather J H Edgar
American Journal of Physical Anthropology | VOL. 168
Jessica M Gross, et. al.Jessica M Gross ... Heather J H Edgar
12 Jan 2019
American Journal of Physical Anthropology | VOL. 168

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancements to the ADMIXTURE algorithm for individual ancestry estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics