Eucalyptusresearch in the post-genome era

Georgios Pappas,Roberto C Togawa,Marilia Cr Pappas,Orzenil B Silva-Junior,Sergio De Alencar,Dario Grattapaglia

doi:10.1186/1753-6561-5-s7-i22

Georgios Pappas, Roberto C Togawa + Show 4 more

Open Access

https://doi.org/10.1186/1753-6561-5-s7-i22

Copy DOI

Abstract

With the efforts of the Joint Genome Institute (JGI) through the EUCAGEN Eucalyptus Genome Network reaching the final release of a Eucalyptus grandis reference genome (BRASUZ1), it is anticipated that this accomplishment will profoundly shape the future research of this global tree genus [1]. One of the first steps toward this end has been the refinement of the genome annotation. Robust models have been generated for protein-coding genes. Here we report the annotation of other pivotal genome constituents, transposable elements (TE) and micro RNA genes. TEs are not only the most dominant elements but are also major drivers of genome plasticity. Several bioinformatic strategies were employed to perform “de novo” and homology based prediction of repetitive elements [2]in the current genome release (version 1.0). A total of 53 distinct TE families could be identified, with the retrotransposon super-family being widely over-represented, as observed for the majority of plant genomes sequenced. Micro RNAs (miRNA), key players in post-transcriptional gene regulation, were annotated using a combination of massively parallel sequencing of small RNA libraries and a genome-wide computational screening to ascertain a compatible secondary structure of the precursor. Both experimental and “in silico” evidences enabled the annotation of 206 distinct miRNA loci comprising 36 different mir gene families including several miRNA isoforms. The blueprint provided by a high-quality reference genome, both in terms of sequence completeness and annotation, will leverage efforts to better characterize intra- and inter-specific sequence variation underlying the marked phenotypic differences among the hundreds of species comprising this genus. Recent advances of DNA sequencing technologies permit a comprehensive interrogation of several other individuals at a fraction of the cost. In this context, we have used Illumina (2x75bp) short read sequencing data of E. globulus clone X46 generated by JGI and made available through the EUCAGEN network to carry out two comparative genomics experiments. From 40X raw sequence data provided it was possible to use an equivalent of 20X coverage (~12Gbp). We first attempted to perform a de novo assembly using VELVET [3]. A total of 161,000 contigs were obtained the largest one sizing at ~3,5kb. In spite of the easy access and low cost of next generation sequencing technologies, these results suggest that even for relatively small forest tree genomes, current technical and computational limitations preclude comprehensive assembly, likely due to the ubiquitous occurrence of repetitive elements in such genomes. Nevertheless when we mapped the E. globulus sequencing data against the BRASUZ1 reference genome, 55% of the reads could be mapped with high confidence. From these, approximately 800,000 high quality single nucleotide polymorphisms (SNPs) could be identified clearly showing the key role that the reference genome will have for future genomic undertakings. The sheer number of molecular markers discovered in this experiment not only fosters more powerful studies on the evolutionary history and population genomics of eucalypts, but also inaugurates a new era in molecular breeding of species of this genus, providing genome-wide coverage for genomic selection and association studies.

Highlights

Micro RNAs, key players in post-transcriptional gene regulation, were annotated using a combination of massively parallel sequencing of small RNA libraries and a genome-wide computational screening to ascertain a compatible secondary structure of the precursor
We have used Illumina (2x75bp) short read sequencing data of E. globulus clone X46 generated by Joint Genome Institute (JGI) and made available through the EUCAGEN network to carry out two comparative genomics experiments
In spite of the easy access and low cost of generation sequencing technologies, these results suggest that even for relatively small forest tree genomes, current technical and computational limitations preclude comprehensive assembly, likely due to the ubiquitous occurrence of repetitive elements in such genomes

Summary

Introduction

Micro RNAs (miRNA), key players in post-transcriptional gene regulation, were annotated using a combination of massively parallel sequencing of small RNA libraries and a genome-wide computational screening to ascertain a compatible secondary structure of the precursor. Recent advances of DNA sequencing technologies permit a comprehensive interrogation of several other

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Eucalyptusresearch in the post-genome era

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings

Lead the way for us

Journal: BMC Proceedings	Publication Date: Sep 13, 2011
License type: CC BY 2.0

Similar Papers

The Chromosome-Level Reference Genome of Tea Tree Unveils Recent Bursts of Non-autonomous LTR Retrotransposons in Driving Genome Size Evolution
...
Molecular Plant | VOL. 13
, et. al. ...
27 Apr 2020
Molecular Plant | VOL. 13

Abstract 2033: Assessment of human reference genomes on cancer somatic mutation detection in tumor-normal paired reference samples using whole genome short-read sequencing data
Chunlin Xiao ... Valerie Schneider
Cancer Research | VOL. 83
Chunlin Xiao, et. al.Chunlin Xiao ... Valerie Schneider
04 Apr 2023
Cancer Research | VOL. 83

Systematic and single cell analysis of Xenopus Piwi-interacting RNAs and Xiwi
Nelson C Lau ... Mark Borowsky
The EMBO Journal | VOL. 28
Nelson C Lau, et. al.Nelson C Lau ... Mark Borowsky
27 Aug 2009
The EMBO Journal | VOL. 28

Long-read sequencing in ecology and evolution: Understanding how complex genetic and epigenetic variants shape biodiversity.
Dan G Bock ... Polina Novikova
Molecular ecology | VOL. 32
Dan G Bock, et. al.Dan G Bock ... Polina Novikova
01 Mar 2023
Molecular ecology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Eucalyptusresearch in the post-genome era

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings