Abstract

Environmental sequence data of microbial communities now makes up the majority of public genomic information. The assignment of a function to sequences from these metagenomic sources is challenging because organisms associated with the data are often uncharacterized and not cultivable. To overcome these challenges, we created a rationally designed expression library of metagenomic proteins covering the sequence space of the thioredoxin superfamily. This library of 100 individual proteins represents more than 22,000 thioredoxins found in the Global Ocean Sampling data set. We screened this library for the functional rescue of Escherichia coli mutants lacking the thioredoxin-type reductase (ΔtrxA), isomerase (ΔdsbC), or oxidase (ΔdsbA). We were able to assign functions to more than a quarter of our representative proteins. The in vivo function of a given representative could not be predicted by phylogenetic relation but did correlate with the predicted isoelectric surface potential of the protein. Selected proteins were then purified, and we determined their activity using a standard insulin reduction assay and measured their redox potential. An unexpected gel shift of protein E5 during the redox potential determination revealed a redox cycle distinct from that of typical thioredoxin-superfamily oxidoreductases. Instead of the intramolecular disulfide bond formation typical for thioredoxins, this protein forms an intermolecular disulfide between the attacking cysteines of two separate subunits during its catalytic cycle. Our functional metagenomic approach proved not only useful to assign in vivo functions to representatives of thousands of proteins but also uncovered a novel reaction mechanism in a seemingly well-known protein superfamily.

Highlights

  • From the free thiol (SH) form to a disulfide bond (S–S) and/or back and donate or accept electrons

  • Categorization of thioredoxin superfamily members from a metagenomic data set In order to advance our understanding of the structure– function relationship of the thioredoxin superfamily, we wanted to categorize all thioredoxin superfamily members found within the Global Ocean Sampling (GOS) data set into the three major groups: reductases, oxidases, and isomerases

  • We selected all sequences from a nonredundant GOS data set that matched to any of the hidden Markov models of the Pfam thioredoxin clan (CL0172) with a score below 10−5 [11]

Read more

Summary

Functional metagenomics of the thioredoxin superfamily

Leichert1,* From the 1Institute of Biochemistry and Pathobiochemistry – Microbial Biochemistry, Ruhr-Universität Bochum, Bochum, Germany; 2Protein Expression and Modification Division, New England Biolabs, Ipswich, Massachusetts, USA; 3Institute for Medical Biochemistry and Molecular Biology, Universität Greifswald, Greifswald, Germany; 4Medizinisches Proteom-Center, RuhrUniversität Bochum, Bochum, Germany

Edited by Ruma Banerjee
Results and discussion
Functional complementation assays
Reductase assay
Oxidase assay
Isomerase assay
Oxidoreductase activity of metagenomic thioredoxins
The metagenomic thioredoxins span a wide range of redox potentials
Conclusion
Definition of thioredoxin superfamily members from the GOS data set
Selection of representative sequences
Strain construction and strain maintenance
Molecular cloning
Secondary structure prediction and isosurface calculation
Protein purification
Insulin reduction assay
Redox potential determination
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call