Abstract

Several monosaccharides constitute naturally occurring glycans, but it is uncertain whether they constitute a universal set like the alphabets of proteins and DNA. Based on the available experimental observations, it is hypothesized herein that the glycan alphabet is not universal. Data on the presence/absence of pathways for the biosynthesis of 55 monosaccharides in 12 939 completely sequenced archaeal and bacterial genomes are presented in support of this hypothesis. Pathways were identified by searching for homologues of biosynthesis pathway enzymes. Substantial variations were observed in the set of monosaccharides used by organisms belonging to the same phylum, genera and even species. Monosaccharides were grouped as common, less common and rare based on their prevalence in Archaea and Bacteria. It was observed that fewer enzymes are sufficient to biosynthesize monosaccharides in the common group. It appears that the common group originated before the formation of the three domains of life. In contrast, the rare group is confined to a few species in a few phyla, suggesting that these monosaccharides evolved much later. Fold conservation, as observed in aminotransferases and SDR (short-chain dehydrogenase reductase) superfamily members involved in monosaccharide biosynthesis, suggests neo- and sub-functionalization of genes led to the formation of the rare group monosaccharides. The non-universality of the glycan alphabet begets questions about the role of different monosaccharides in determining an organism’s fitness.

Highlights

  • Living organisms show enormous diversity in organization, size, morphology, habitat, etc., but are unified by the highly conserved processes of central dogma: replication, transcription and translation

  • We have identified the prevalence of 55 monosaccharide biosynthesis pathways in 12939 completely sequenced archaeal and bacterial genomes by searching for homologs of biosynthesis pathway enzymes using Hidden Markov Model (HMM) profiles, and in a few cases, BLASTp

  • Glycan alphabet size is not the same across Archaea and across Bacteria: The number of monosaccharides used by different species is significantly different (Figure 1) and is independent of proteome size (Figure S3)

Read more

Summary

Introduction

Living organisms show enormous diversity in organization, size, morphology, habitat, etc., but are unified by the highly conserved processes of central dogma: replication, transcription and translation. The enormous diversity seen in life forms is encoded by DNA and decoded primarily by proteins. Both DNA and proteins use the same set of building blocks (nucleotide bases and amino acids, respectively) in all organisms; yet, they store the requisite information by merely varying the (i) set/subset of building blocks used, (ii) number of times each building block is used and (iii) sequence in which the building blocks are linked [collectively referred to as the ‘sequence’ (Table 1)]. The question is, do glycans use the same set of building blocks (viz., monosaccharides) in all organisms, the way proteins and nucleic acids do?

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call