Genome-wide divergence, haplotype distribution and population demographic histories for Gossypium hirsutum and Gossypium barbadense as revealed by genome-anchored SNPs

Umesh K Reddy,Tariq Shehzad,Andrew H Paterson,Justin T Page,James Frelichowski,Dong Zhang,Venkata Lakshmi Abburi,C V C M Reddy,John Z Yu,Joshua A Udall,Padma Nimmakayala,Richard G Percy,Thangasamy Saminathan

doi:10.1038/srep41285

Abstract

Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using the SNPs distributed genome-wide, we examined genetic diversity, haplotype distribution and linkage disequilibrium patterns in the G. hirsutum and G. barbadense genomes to clarify population demographic history. Diversity and identity-by-state analyses have revealed little sharing of alleles between the two cultivated allotetraploid genomes, with a few exceptions that indicated sporadic gene flow. We found a high number of new alleles, representing increased nucleotide diversity, on chromosomes 1 and 2 in cultivated G. hirsutum as compared with low nucleotide diversity on these chromosomes in landrace G. hirsutum. In contrast, G. barbadense chromosomes showed negative Tajima’s D on several chromosomes for both cultivated and landrace types, which indicate that speciation of G. barbadense itself, might have occurred with relatively narrow genetic diversity. The presence of conserved linkage disequilibrium (LD) blocks and haplotypes between G. hirsutum and G. barbadense provides strong evidence for comparable patterns of evolution in their domestication processes. Our study illustrates the potential use of population genetic techniques to identify genomic regions for domestication.

Highlights

Cultivars of G. hirsutum and G. barbadense produce the overwhelming majority of the world’s cotton fiber and oil
This report focuses on a comparative study of linkage disequilibrium (LD) among G. hirsutum and G. barbadense chromosomes and constraints in population structure of both allopolyploids
A representative sample of 658 cotton accessions (440 of G. hirsutum and 218 of G. barbadense) collected from 85 countries in North America, South America, Europe, Asia, and Africa were obtained from the National Cotton Germplasm Collection (NCGC) maintained by the USDA-ARS in College Station TX34 (Table S1)

Summary

Materials and Methods

The separate TagCounts files were merged to form a “master” TagCounts file, which retained only those tags present at or above an experiment-wide minimum count. This master tag list was aligned to the TM-1 (G. hirsutum) reference genome[33] and a Tags On Physical Map (TOPM) file was generated, containing the genomic position of each tag with a unique, best alignment. The information recorded in the TOPM and TBT was used to discover SNPs at each “TagLocus” (set of tags with the same genomic position) and filter the SNPs based upon the proportion of taxa covered by the TagLocus and minor allele frequency[40]. For computing linkage disequilibrium (LD), we used expectation-maximization (EM) algorithm, formalized by[46], is a iterative technique for obtaining maximum likelihood estimates of sample haplotype frequencies

Results and Discussion

Author Contributions

Additional Information

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jan 27, 2017
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Genome-wide divergence, haplotype distribution and population demographic histories for Gossypium hirsutum and Gossypium barbadense as revealed by genome-anchored SNPs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes
Ji-Rong Long ... Yong-Jun Liu
BMC Genetics | VOL. 5
Ji-Rong Long, et. al.Ji-Rong Long ... Yong-Jun Liu
01 Jan 2004
BMC Genetics | VOL. 5

Planting and Irrigation Termination Timing Effects on the Yield of Upland and Pima Cotton
Bryan L Unruh ... Jeffrey C Silvertooth
Journal of Production Agriculture | VOL. 10
Bryan L Unruh, et. al.Bryan L Unruh ... Jeffrey C Silvertooth
01 Jan 1997
Journal of Production Agriculture | VOL. 10

Population differences in haplotype structure within a human olfactory receptor gene cluster.
I Menashe
Human molecular genetics | VOL. 11
I MenasheI Menashe
01 Jun 2002
Human molecular genetics | VOL. 11

Extent and Distribution of Linkage Disequilibrium in Three Genomic Regions
Gonçalo R Abecasis ... William O.C Cookson
The American Journal of Human Genetics | VOL. 68
Gonçalo R Abecasis, et. al.Gonçalo R Abecasis ... William O.C Cookson
01 Jan 2001
The American Journal of Human Genetics | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genome-wide divergence, haplotype distribution and population demographic histories for Gossypium hirsutum and Gossypium barbadense as revealed by genome-anchored SNPs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports