Abstract
Next Generation Sequencing technologies significantly advance the development of molecular markers for molecular breeding. Dedicated NGS data-analysis procedures must be developed for de novo reference assembly and SNP discovery in crop species without a reference genome sequence. In outcrossing fodder crops, the high degree of polymorphism hampers de novo assembly, contig clustering, read mapping, and SNP discovery. Using selected candidate genes as case studies, we illustrate the reconstruction of a reference transcript sequence from RNA-seq data from multiple genotypes, we validate de novo transcript assembly by Sanger sequencing, and analyse how read mapping and SNP discovery parameters determine sensitivity and specificity during SNP discovery. Thus, we propose a general strategy to construct a non-redundant reference transcriptome for crops without a sequenced genome, using predicted proteins from a closely related model species as a guidance for clustering and annotation. This reference transcriptome is required for candidate gene discovery and exome-wide identification of polymorphisms.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.