Abstract

Teosinte (Zea mays ssp. parviglumis), the wild progenitor of maize (Zea mays L.), is an important germplasm resource for improvement of modern maize lines. However, we have limited genetic and genomic information about teosinte and lack state-of-the-art tools to annotate transcriptomes assembled by single-molecule long-read sequencing without a reference genome. Here, we employed single-molecule long-read sequencing of cDNA libraries from five tissues of the teosinte inbred line TIL11 and identified 70,044 nonredundant transcript isoforms. We devised a state-of-the-art, machine learning-based bioinformatics pipeline DenovoAS_Finder to annotate the TIL11 transcriptome without a complete reference genome with an accuracy of up to 91%, providing a robust gene classifier of complex genomes. Additionally, we constructed a draft TIL11genome with 16,633high-quality contigs and a N50 of 112kb by Nanopore sequencing. Genes from families that expanded from teosinte to maize were significantly enriched in the gene ontology (GO) term "RNA modification pathway" and had more transcript isoforms in TIL11 than in the maize inbred line B73. Genes showed collinearity between TIL11 and B73, and intergenic regions were extensively altered by transposable elements. Our study furthers the understanding of maize domestication and provides a resource for the utilization of wild germplasm in maize breeding.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call