Abstract

BackgroundSireviruses are an ancient genus of the Copia superfamily of LTR retrotransposons, and the only one that has exclusively proliferated within plant genomes. Based on experimental data and phylogenetic analyses, Sireviruses have successfully infiltrated many branches of the plant kingdom, extensively colonizing the genomes of grass species. Notably, it was recently shown that they have been a major force in the make-up and evolution of the maize genome, where they currently occupy ~21% of the nuclear content and ~90% of the Copia population. It is highly likely, therefore, that their life dynamics have been fundamental in the genome composition and organization of a plethora of plant hosts. To assist studies into their impact on plant genome evolution and also facilitate accurate identification and annotation of transposable elements in sequencing projects, we developed MASiVEdb (Mapping and Analysis of SireVirus Elements Database), a collective and systematic resource of Sireviruses in plants.DescriptionTaking advantage of the increasing availability of plant genomic sequences, and using an updated version of MASiVE, an algorithm specifically designed to identify Sireviruses based on their highly conserved genome structure, we populated MASiVEdb (http://bat.infspire.org/databases/masivedb/) with data on 16,243 intact Sireviruses (total length >158Mb) discovered in 11 fully-sequenced plant genomes. MASiVEdb is unlike any other transposable element database, providing a multitude of highly curated and detailed information on a specific genus across its hosts, such as complete set of coordinates, insertion age, and an analytical breakdown of the structure and gene complement of each element. All data are readily available through basic and advanced query interfaces, batch retrieval, and downloadable files. A purpose-built system is also offered for detecting and visualizing similarity between user sequences and Sireviruses, as well as for coding domain discovery and phylogenetic analysis.ConclusionMASiVEdb is currently the most comprehensive directory of Sireviruses, and as such complements other efforts in cataloguing plant transposable elements and elucidating their role in host genome evolution. Such insights will gradually deepen, as we plan to further improve MASiVEdb by phylogenetically mapping Sireviruses into families, by including data on fragments and solo LTRs, and by incorporating elements from newly-released genomes.

Highlights

  • Sireviruses are an ancient genus of the Copia superfamily of long terminal repeat (LTR) retrotransposons, and the only one that has exclusively proliferated within plant genomes

  • This has left as yet open questions on the position of Sireviruses within the LTR retrotransposon order, and whether they should be considered as viruses

  • Sequence search We consider the sequence similarity-based access to the Sirevirus data among the most important aspects of MASiVEdb, for which we developed an integrated system collectively termed ‘LTRphyler’, and linked it to the database sequences

Read more

Summary

Conclusion

MASiVEdb is so far the most comprehensive directory for Sireviruses, an abundant and distinctive genus of plant LTR retrotransposons, in currently available fullysequenced plant genomes. Additional file 1: Phylogenetic and genome structure analyses within the Copia superfamily This file contains the composite Additional file 1: Figure S1 that shows i) the phylogenetic relationships (based on the RT core domain) of exemplars from all three Copia genera, and ii) the highly conserved genome organization of Sireviruses and its comparison with the genome of other non-Sirevirus Copia elements (Figure adapted from [24]). Author details 1Institute of Agrobiotechnology, Centre for Research and Technology Hellas, Thessaloniki 57001, Greece. Author’s contributions AB conceived the study, collected and curated data, drafted the manuscript. NK and MP assisted in data collection and algorithm development. AT curated data and assisted in algorithm development. ND coordinated the study, developed algorithms, and drafted the manuscript.

Background
Discussion
Bennetzen JL
Findings
18. Volff JN
35. Bousios A
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call