Automated de novo identification of repeat sequence families in sequenced genomes.

Zhirong Bao,Sean R Eddy

doi:10.1101/gr.88502

Automated de novo identification of repeat sequence families in sequenced genomes.

Zhirong Bao, Sean R Eddy

Open Access

PDF Available

https://doi.org/10.1101/gr.88502

Copy DOI

Export

Save

Cite

Journal: Genome Research	Publication Date: Jul 19, 2002
Citations: 839	License type: cc-by-nc

Affiliation: Howard Hughes Medical Institute, University of Washington

#Identification Of Families #Transposable Elements #Classification Of Families #Human Genome #Homologous Repeat #Homologous Element #Single Clustering #Homologous Families #Approach Of Clustering #Genomic Sequences

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Repetitive sequences make up a major part of eukaryotic genomes. We have developed an approach for the de novo identification and classification of repeat sequence families that is based on extensions to the usual approach of single linkage clustering of local pairwise alignments between genomic sequences. Our extensions use multiple alignment information to define the boundaries of individual copies of the repeats and to distinguish homologous but distinct repeat element families. When tested on the human genome, our approach was able to properly identify and group known transposable elements. The program, should be useful for first-pass automatic classification of repeats in newly sequenced genomes.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Genome Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Automated de novo identification of repeat sequence families in sequenced genomes.