Sequence, structure and evolutionary analysis of cold shock domain proteins, a member of OB fold family.

Mohd Amir,Faizan Ahmad,Ravins Dohare,Md Imtaiyaz Hassan,Vijay Kumar,Asimul Islam

doi:10.1111/jeb.13382

Abstract

The cold shock domain (CSD) belongs to the oligosaccharide/oligonucleotide-binding fold superfamily which is highly conserved from prokaryotes to higher eukaryotes, and appears to function as RNA chaperones. CSD is involved in diverse cellular processes, including adaptation to low temperatures, nutrient stress, cellular growth and developmental processes. Structural Classification of Proteins (SCOP) database broadly classifies OB fold proteins into 18 different superfamilies, including nucleic acid-binding superfamily (NAB). The NAB is further divided into 17 families together with cold shock DNA-binding protein family (CSDB). The CSDB have more than 240000 sequences in UniProt database consisting of 32 domains including CSD. Among these domains, CSD is the second largest sequence contributor (>40398 sequences). Herein, we have systematically analysed the relative abundance and distribution of CSD proteins based on sequences, structures, repeats and gene ontology (GO) molecular functions in all domains of life. Analysis of sequence distribution suggesting that CSDs are largely found in bacteria (83-94%) with single CSD repeat. However, repeat distribution in eukaryota varies from 1 to 5 in combination with other auxiliary domain that makes CSD proteins functionally more diverse compared to the bacterial counterparts. Further, analysis of repeats distributions on evolutionary scale suggest that existence of CSD in multiple repeats is mainly driven through speciation, gene shuffling and gene duplication events.

Full Text