Polyploidy is an important driving force in speciation and evolution; however, the genomic basis for parallel selection of a particular trait between polyploids and ancestral diploids remains unexplored. Here we construct graph-based pan-genomes for diploid (A2) and allotetraploid (AD1) cotton species, enabled by an assembly of 50 genomes of genetically diverse accessions. We delineate a mosaic genome map of tetraploid cultivars that illustrates genomic contributions from semi-wild forms into modern cultivars. Pan-genome comparisons identify syntenic and hyper-divergent regions of continued variation between diploid and tetraploid cottons, and suggest an ongoing process of sequence evolution potentially linked to the contrasting genome size change in two subgenomes. We highlight 43% of genetic regulatory relationships for gene expression in diploid encompassing sequence divergence after polyploidy, and specifically characterize six underexplored convergent genetic loci contributing to parallel selection of fiber quality. This study offers a framework for pan-genomic dissection of genetic regulatory components underlying parallel selection of desirable traits in organisms.
Read full abstract