Abstract

While large-scale genomic approaches are increasingly revealing the genetic basis of polymorphic phenotypes such as colour morphs, such approaches are almost exclusively conducted in species with high-quality genomes and annotations. Here, we use Pool-Seq data for both genome assembly and SNP frequency estimation, followed by scanning for FST outliers to identify divergent genomic regions. Using paired-end, short-read sequencing data from two groups of individuals expressing divergent phenotypes, we generate a de novo rough-draft genome, identify SNPs and calculate genomewide FST differences between phenotypic groups. As genomes generated by Pool-Seq data are highly fragmented, we also present an approach for super-scaffolding contigs using existing protein-coding data sets. Using this approach, we reanalysed genomic data from two recent studies of birds and butterflies investigating colour pattern variation and replicated their core findings, demonstrating the accuracy and power of a Pool-Seq-only approach. Additionally, we discovered new regions of high divergence and new annotations that together suggest novel parallels between birds and butterflies in the origins of their colour pattern variation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call