Abstract

High-throughput SNP genotyping has become a precondition to move to higher precision and wider genome coverage genetic analysis of natural and breeding populations of non-model species. We developed a 44,318 annotated SNP catalog for Araucaria angustifolia, a grandiose subtropical conifer tree, one of the only two native Brazilian gymnosperms, critically endangered due to its valuable wood and seeds. Following transcriptome assembly and annotation, SNPs were discovered from RNA-seq and pooled RAD-seq data. From the SNP catalog, an Axiom® SNP array with 3,038 validated SNPs was developed and used to provide a comprehensive look at the genetic diversity and structure of 15 populations across the natural range of the species. RNA-seq was a far superior source of SNPs when compared to RAD-seq in terms of conversion rate to polymorphic markers on the array, likely due to the more efficient complexity reduction of the huge conifer genome. By matching microsatellite and SNP data on the same set of A. angustifolia individuals, we show that SNPs reflect more precisely the actual genome-wide patterns of genetic diversity and structure, challenging previous microsatellite-based assessments. Moreover, SNPs corroborated the known major north-south genetic cline, but allowed a more accurate attribution to regional versus among-population differentiation, indicating the potential to select ancestry-informative markers. The availability of a public, user-friendly 3K SNP array for A. angustifolia and a catalog of 44,318 SNPs predicted to provide ~29,000 informative SNPs across ~20,000 loci across the genome, will allow tackling still unsettled questions on its evolutionary history, toward a more comprehensive picture of the origin, past dynamics and future trend of the species' genetic resources. Additionally, but not less importantly, the SNP array described, unlocks the potential to adopt genomic prediction methods to accelerate the still very timid efforts of systematic tree breeding of A. angustifolia.

Highlights

  • The development of high throughput genotyping tools based on large numbers of single-nucleotide polymorphisms (SNPs) markers has become a prerequisite to move to a higher level of precision and genome coverage for the genetic analysis of natural and breeding populations of non-model organisms [1]

  • We have developed the first comprehensive SNP resource for Araucaria angustifolia, a keystone subtropical conifer tree, critically endangered due to its valuable wood and seeds

  • Data obtained with this newly developed SNP genotyping platform provided a comprehensive look at the range-wide genetic diversity and structure of the species

Read more

Summary

Introduction

The development of high throughput genotyping tools based on large numbers of SNP markers (single nucleotide polymorphisms) has become a prerequisite to move to a higher level of precision and genome coverage for the genetic analysis of natural and breeding populations of non-model organisms [1]. Other methods have adopted different approaches of restriction enzyme digestion followed by high throughput sequencing such as Restriction site associated DNA (RAD-seq) [4] the various alternative protocols of genotyping by sequencing (GbS) [5], and targeted enrichment by sequence capture [6]. These methods have allowed the discovery of large numbers of SNPs, and direct SNP genotyping at accessible costs for under resourced plant and animal species [7]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call