A Tutorial of EDTA: Extensive De Novo TE Annotator.

Weijia Su,Shujun Ou,Thomas Peterson,Matthew B Hufford

doi:10.1007/978-1-0716-1134-0_4

Abstract

Transposable elements (TEs) are important contributors to genome structure and evolution. With the growth of sequencing technologies, various computational pipelines and software programs have been developed to facilitate TE identification and annotation. These computational tools can be categorized into three types based on their underlying approach: homology-based, structural-based, and de novo methods. Each of these tools has advantages and disadvantages. In this chapter, we introduce EDTA (Extensive de novo TE Annotator), a new comprehensive pipeline composed of high-quality tools to identify and annotate all types of TEs. The development of EDTA is based on the benchmarking results of a collection of TE annotation methods. The selected programs are evaluated by their ability to identify true TEs as well as to exclude false candidates. Here, we present an overview of the EDTA pipeline and a detailed manual for its use. The source code of EDTA is available at https://github.com/oushujun/EDTA .

Full Text