Abstract

The biogeographical ancestry (BGA) information can provide supporting information in epidemiology and leading intelligence in forensics. Several sets of ancestral informative markers (AIM) have been proposed to facilitate the BGA inference. A small set of markers can improve efficiency though, it has limitations in their ability of balancing different populations and differentiating sub-populations. Genome-wide SNPs provide much more comprehensive information of an individual's ancestral information. In this paper, we study the problem of BGA inference under the abundance of genome-wide high density data. We studied 1043 individuals from 7 continental populations of the Human Genome Diversity Panel at 32212 genome-wide autosomal single nucleotide polymorphism (SNP) loci. We detected the population structure and compared the BGA inference accuracy using three widely used genetic sequence analysis algorithms through AIMs and genome-wide SNPs. Our results show that genome-wide SNPs reveal population structure with clearer clusterness and provide more accurate BGA inference, confirming the rich information carried by genome-wide SNPs. The findings help to give a clearer picture of candidate ancestral population groups of an individual, and potentially help the BGA inference in a fine population scale.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call