The gene LOC791917 Danio rerio (zebrafish) encodes a protein annotated in the UniProt knowledgebase1 as the “middle domain of eukaryotic initiation factor 4G domain containing protein b” (MIF4Gdb). Its molecular weight is 25.8 kDa, and it comprises 222 amino acid residues. BLAST searches revealed homologues of D. rerio MIF4Gdb in many eukaryotes including humans.2 The homologues and MIF4Gdb were identified as members of the Pfam family, MIF4G (PF02854), which is named after the middle domain of eukaryotic initiation factor 4G (eIF4G).3-5 eIF4G is a component of eukaryotic translational initiation complex, and contains binding sites for other initiation factors, suggesting its critical role in translational initiation.6 The MIF4G domain also occurs in several other proteins involved in RNA metabolism, including the Nonsense-mediated mRNA decay 2 protein (NMD2/UPF2), and the nuclear cap-binding protein 80-kD subunit (CBP80).5 Sequence and structure analysis of the MIF4G domains in many proteins indicates that the domain assumes all helical fold and has tandem repeated motifs.5,7 The zebrafish protein described here has homology to domains of other proteins variously referred to as NIC-containing proteins (NMD2, eIF4G, CBP80). The biological function of D. rerio MIF4Gdb has not yet been experimentally characterized, and the annotation is based on amino acid sequence comparison. D. rerio MIF4Gdb did not share more than 25% sequence identity with any protein for which the three-dimensional structure is known and was selected as a target for structure determination by the Center for Eukaryotic Structural Genomics (CESG). Here, we report the crystal structure of D. rerio MIF4Gdb (UniGene code Dr.79360, UniProt code {"type":"entrez-protein","attrs":{"text":"Q5EAQ1","term_id":"82178873"}}Q5EAQ1, CESG target number GO.79294).
Read full abstract