To elucidate the population history of the Caucasus, we conducted a survey of genetic diversity in Samegrelo (Mingrelia), western Georgia. We collected DNA samples and genealogical information from 485 individuals residing in 30 different locations, the vast majority of whom being Mingrelian speaking. From these DNA samples, we generated mitochondrial DNA (mtDNA) control region sequences for all 485 participants (female and male), Y-short tandem repeat haplotypes for the 372 male participants, and analyzed all samples at nearly 590,000 autosomal single nucleotide polymorphisms (SNPs) plus around 33,000 on the sex chromosomes, with 27,000 SNP removed for missingness, using the GenoChip 2.0+ microarray. The resulting data were compared with those from populations from Anatolia, the Caucasus, the Near East, and Europe. Overall, Mingrelians exhibited considerable mtDNA haplogroup diversity, having high frequencies of common West Eurasian haplogroups (H, HV, I, J, K, N1, R1, R2, T, U, and W. X2) and low frequencies of East Eurasian haplogroups (A, C, D, F, and G). From a Y-chromosome standpoint, Mingrelians possessed a variety of haplogroups, including E1b1b, G2a, I2, J1, J2, L, Q, R1a, and R1b. Analysis of autosomal SNP data further revealed that Mingrelians are genetically homogeneous and cluster with other modern-day South Caucasus populations. When compared with ancient DNA samples from Bronze Age archaeological contexts in the broader region, these data indicate that the Mingrelian gene pool began taking its current form at least by this period, probably in conjunction with the formation of a distinct linguistic community.
Read full abstract