Tandemly arranged and dispersed repetitive DNA sequences are important structural and functional elements that make up a significant portion of vertebrate genomes. Using high throughput, low coverage whole genome sequencing followed by bioinformatics analysis, we have identified seven major tandem repetitive DNAs and two fragments of LTR retrotransposons in the genome of the Nile crocodile (Crocodylus niloticus, 2n = 32). The repeats showed great variability in structure, genomic organization, and chromosomal distribution as revealed by fluorescence in situ hybridization (FISH). We found that centromeric and pericentromeric heterochromatin of C. niloticus is composed of previously described in Crocodylus siamensis CSI-HindIII and CSI-DraI repetitive sequence families, a satellite revealed in Crocodylus porosus, and additionally contains at least three previously unannotated tandem repeats. Both LTR sequences identified here belong to the ERV1 family of endogenous retroviruses. Each pericentromeric region was characterized by a diverse set of repeats, with the exception of chromosome pair 4, in which we found only one type of satellite. Only a few repeats showed non-centromeric signals in addition to their centromeric localization. Mapping of 18S-28S ribosomal RNA genes and telomeric sequences (TTAGGG)n did not demonstrate any co-localization of these sequences with revealed centromeric and pericentromeric heterochromatic blocks.
Read full abstract