HydDB: A web tool for hydrogenase classification and analysis.

Dan Søndergaard,Chris Greening,Christian N S Pedersen

doi:10.1038/srep34212

Abstract

H2 metabolism is proposed to be the most ancient and diverse mechanism of energy-conservation. The metalloenzymes mediating this metabolism, hydrogenases, are encoded by over 60 microbial phyla and are present in all major ecosystems. We developed a classification system and web tool, HydDB, for the structural and functional analysis of these enzymes. We show that hydrogenase function can be predicted by primary sequence alone using an expanded classification scheme (comprising 29 [NiFe], 8 [FeFe], and 1 [Fe] hydrogenase classes) that defines 11 new classes with distinct biological functions. Using this scheme, we built a web tool that rapidly and reliably classifies hydrogenase primary sequences using a combination of k-nearest neighbors’ algorithms and CDD referencing. Demonstrating its capacity, the tool reliably predicted hydrogenase content and function in 12 newly-sequenced bacteria, archaea, and eukaryotes. HydDB provides the capacity to browse the amino acid sequences of 3248 annotated hydrogenase catalytic subunits and also contains a detailed repository of physiological, biochemical, and structural information about the 38 hydrogenase classes defined here. The database and classifier are freely and publicly available at http://services.birc.au.dk/hyddb/

Highlights

In this work, we build on these findings to develop the first web database for the classification and analysis of hydrogenases
We initially developed a classification scheme to enable prediction of hydrogenase function by primary sequence alone
We visualized the relationships between all hydrogenases in sequence similarity networks (SSN)[18], in which nodes represent individual proteins and the distances between them reflect BLAST E-values

Summary

Introduction

We build on these findings to develop the first web database for the classification and analysis of hydrogenases. We developed an expanded classification scheme that captures the full sequence diversity of hydrogenase enzymes and predicts their biological function. Using this information, we developed a classification tool based on the k-nearest neighbors’ (k-NN) method. HydDB is a user-friendly, high-throughput, and functionally-predictive tool for hydrogenase classification that operates with precision exceeding 99.8%

Methods

Results

Conclusion