RetroPred: A tool for prediction, classification and extraction of non-LTR retrotransposons (LINEs & SINEs) from the genome by integrating PALS, PILER, MEME and ANN.

Pradeep Kumar Naik,Sumit Gupta,Vinay Kumar Mittal

doi:10.6026/97320630002263

Abstract

The problem of predicting non-long terminal repeats (LTR) like long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs) from the DNA sequence is still an open problem in bioinformatics. To elevate the quality of annotations of LINES and SINEs an automated tool "RetroPred" was developed. The pipeline allowed rapid and thorough annotation of non-LTR retrotransposons. The non-LTR retrotransposable elements were initially predicted by Pairwise Aligner for Long Sequences (PALS) and Parsimonious Inference of a Library of Elementary Repeats (PILER). Predicted non-LTR elements were automatically classified into LINEs and SINEs using ANN based on the position specific probability matrix (PSPM) generated by Multiple EM for Motif Elicitation (MEME). The ANN model revealed a superior model (accuracy = 78.79 +/- 6.86 %, Q(pred) = 74.734 +/- 17.08 %, sensitivity = 84.48 +/- 6.73 %, specificity = 77.13 +/- 13.39 %) using four-fold cross validation. As proof of principle, we have thoroughly annotated the location of LINEs and SINEs in rice and Arabidopsis genome using the tool and is proved to be very useful with good accuracy. Our tool is accessible at http://www.juit.ac.in/RepeatPred/home.html.

Highlights

Long interspersed elements (LINEs) and short interspersed elements (SINEs) are non-long terminal repeats (LTR) retrotransposons that reside within cells of a host organism, copying and inserting themselves into the host genome
Repetitive sequences are an important feature of eukaryotic genomes accounting for a large proportion of the genome; at least 50% of the human [1] and about 80% in some plants [2] genome seems to be composed by repetitive elements
The ANN model develop in this study (200-7-2) is trained with the position specific probability matrix (PSPM) matrix calculated using Multiple EM for Motif Elicitation (MEME)

Summary

Introduction

Long interspersed elements (LINEs) and short interspersed elements (SINEs) are non-LTR retrotransposons that reside within cells of a host organism, copying and inserting themselves into the host genome. The annotation of genomic repeats, typically relies on the results of a single computational program, RepeatMasker

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformation	Publication Date: Jan 11, 2008
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

RetroPred: A tool for prediction, classification and extraction of non-LTR retrotransposons (LINEs & SINEs) from the genome by integrating PALS, PILER, MEME and ANN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformation

Lead the way for us

Similar Papers

Diversity of short interspersed nuclear elements (SINEs) in lepidopteran insects and evidence of horizontal SINE transfer between baculovirus and lepidopteran hosts
Guangjie Han ... Heng Jiang
BMC Genomics | VOL. 22
Guangjie Han, et. al.Guangjie Han ... Heng Jiang
31 Mar 2021
BMC Genomics | VOL. 22

PSVII-10 Genome-wide discovery and characterization of short interspersed nuclear elements (SINEs) in the bovine genome
Naisu Yang ... Antony T Vincent
Journal of Animal Science | VOL. 102
Naisu Yang, et. al.Naisu Yang ... Antony T Vincent
14 Sep 2024
Journal of Animal Science | VOL. 102

Analysis of the 227 bp short interspersed nuclear element (SINE) insertion of the promoter of the myostatin (MSTN) gene in different horse breeds.
... Marco Tassinari
Veterinaria italiana | VOL. 50
, et. al. ... Marco Tassinari
01 Feb 2014
Veterinaria italiana | VOL. 50

Recombinant SINEs are formed at high frequency during induced retrotransposition in vivo
Vijay Pal Yadav ... Prabhat Kumar Mandal
Nature Communications | VOL. 3
Vijay Pal Yadav, et. al.Vijay Pal Yadav ... Prabhat Kumar Mandal
01 Jan 2012
Nature Communications | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RetroPred: A tool for prediction, classification and extraction of non-LTR retrotransposons (LINEs & SINEs) from the genome by integrating PALS, PILER, MEME and ANN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformation