Abstract

Attempts were made to define the relationship among the three domains (eukaryotes, archaea, and eubacteria) using phylogenetic tree analyses of 16S rRNA sequences as well as of other protein sequences. Since the results are inconsistent, it is implied that the eukaryotic genome has a chimeric structure. In our previous studies, the origin of eukaryotes to be the symbiosis of archaea into eubacteria using the whole open reading frames (ORF) of many genomes was suggested. In these studies, the species participating in the symbiosis were not clarified, and the effect of gene duplication after speciation (in-paralog) was not addressed. To avoid the influence of the in-paralog, we developed a new method to calculate orthologous ORFs. Furthermore, we separated eukaryotic in-paralogs into three groups by sequence similarity to archaea, eubacteria (other than alpha-proteobacteria), and alpha-proteobacteria and treated them as individual organisms. The relationship between the three ORF groups and the functional classification was clarified by this analysis. The introduction of this new method into the phylogenetic tree analysis of 66 organisms (4 eukaryotes, 13 archaea, and 49 eubacteria) based on gene content suggests the symbiosis of pyrococcus into gamma-proteobacteria as the origin of eukaryotes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call