Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2.

Petr Novák,Jiří Macas,Pavel Neumann

doi:10.1038/s41596-020-0400-y

Abstract

RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2.

Abstract

Talk to us

Similar Papers

More From: Nature Protocols

Lead the way for us

Journal: Nature Protocols	Publication Date: Oct 23, 2020
Citations: 177

Similar Papers

Identification of polymorphic tandem repeats by direct comparison of genome sequence from different bacterial strains : a web-based resource
France Denœud ... Gilles Vergnaud
BMC bioinformatics | VOL. 5
France Denœud, et. al.France Denœud ... Gilles Vergnaud
01 Jan 2004
BMC bioinformatics | VOL. 5

Uncovering the dark matter of the metagenome one read at a time
Nicholas Dimonaco ... Kim Kenobi
Access Microbiology | VOL. 1
Nicholas Dimonaco, et. al.Nicholas Dimonaco ... Kim Kenobi
01 Mar 2019
Access Microbiology | VOL. 1

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Andrea Manconi
EMBnet.journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Andrea Manconi
09 Nov 2012
EMBnet.journal | VOL. 18

Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem
Andrzej K Brodzik
Bioinformatics | VOL. 23
Andrzej K BrodzikAndrzej K Brodzik
19 Jan 2007
Bioinformatics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2.

Abstract

Talk to us

Similar Papers

More From: Nature Protocols