Abstract

PremiseAn informatics approach was used for the construction of an Axiom genotyping array from heterogeneous, high‐throughput sequence data to assess the complex genome of loblolly pine (Pinus taeda).MethodsHigh‐throughput sequence data, sourced from exome capture and whole genome reduced‐representation approaches from 2698 trees across five sequence populations, were analyzed with the improved genome assembly and annotation for the loblolly pine. A variant detection, filtering, and probe design pipeline was developed to detect true variants across and within populations. From 8.27 million variants, a total of 642,275 were evaluated and 423,695 of those were screened across a range‐wide population.ResultsThe final informatics and screening approach delivered an Axiom array representing 46,439 high‐confidence variants to the forest tree breeding and genetics community. Based on the annotated reference genome, 34% were located in or directly upstream or downstream of genic regions.DiscussionThe Pita50K array represents a genome‐wide resource developed from sequence data for an economically important conifer, loblolly pine. It uniquely integrates independent projects that assessed trees sampled across the native range. The challenges associated with the large and repetitive genome are addressed in the development of this resource.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call