Abstract

Inteins are self-splicing protein elements found in viruses and all three domains of life. How the DNA encoding these selfish elements spreads within and between genomes is poorly understood, particularly in eukaryotes where inteins are scarce. Here, we show that the nuclear genomes of three strains of Anaeramoeba encode between 45 and 103 inteins, in stark contrast to four found in the most intein-rich eukaryotic genome described previously. The Anaeramoeba inteins reside in a wide range of proteins, only some of which correspond to intein-containing proteins in other eukaryotes, prokaryotes, and viruses. Our data also suggest that viruses have contributed to the spread of inteins in Anaeramoeba and the colonization of new alleles. The persistence of Anaeramoeba inteins might be partly explained by intragenomic movement of intein-encoding regions from gene to gene. Our intein dataset greatly expands the spectrum of intein-containing proteins and provides insights into the evolution of inteins in eukaryotes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call