GeneHunt for rapid domain-specific annotation of glycoside hydrolases

S N Nguyen,R Berlemont,J Schwans,D Talamantes,A Flores,F Dar,A Valdez

doi:10.1038/s41598-019-46290-w

S N Nguyen, R Berlemont + Show 5 more

Open Access

https://doi.org/10.1038/s41598-019-46290-w

Copy DOI

Journal: Scientific Reports	Publication Date: Jul 12, 2019
Citations: 19	License type: open-access

Affiliation: California State University, Long Beach

Abstract

The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels. Here, we investigate the potential of sequential HMM-profile identification for the rapid and precise identification of the multi-domain architecture of GHs from various datasets. First, as a validation, we successfully reannotated >98% of the biochemically characterized enzymes listed on the CAZy database. Next, we analyzed the 43 million non-redundant sequences from the M5nr data and identified 322,068 unique GHs. Finally, we searched 129 assembled metagenomes retrieved from MG-RAST for environmental GHs and identified 160,790 additional enzymes. Although most identified sequences corresponded to single domain enzymes, many contained several domains, including known accessory domains and some domains never identified in association with GH. Several sequences displayed multiple catalytic domains and few of these potential multi-activity proteins combined potentially synergistic domains. Finally, we produced and confirmed the biochemical activities of a GH5-GH10 cellulase-xylanase and a GH11-CE4 xylanase-esterase. Globally, this “gene to enzyme pipeline” provides a rationale for mining large datasets in order to identify new catalysts combining unique properties for the efficient deconstruction of polysaccharides.

Highlights

The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels
In order to identify new catalysts for biomass degradation, we examined the performance of sequential Hidden Markov Model (HMM) identifications[28] combined with publicly accessible HMM-profiles from the PFam database[29], here referred to as the GeneHunt approach[2,30], to detect GH-sequences and investigate their detailed architecture[2]
In order to evaluate the GeneHunt approach, weannotated the sequences of biochemically characterized GHs listed on the Carbohydrate-Active Enzymes (CAZy) database

Summary

Introduction

The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels. We produced and confirmed the biochemical activities of a GH5-GH10 cellulase-xylanase and a GH11-CE4 xylanase-esterase This “gene to enzyme pipeline” provides a rationale for mining large datasets in order to identify new catalysts combining unique properties for the efficient deconstruction of polysaccharides. In order to identify new catalysts for biomass degradation, we examined the performance of sequential Hidden Markov Model (HMM) identifications[28] combined with publicly accessible HMM-profiles from the PFam database[29], here referred to as the GeneHunt approach[2,30], to detect GH-sequences and investigate their detailed architecture (i.e., the precise domain organization of MDGHs)[2]. We identified the detailed multi-domain architecture of GH proteins in assembled, publicly accessible, metagenomes from www.nature.com/scientificreports/

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GeneHunt for rapid domain-specific annotation of glycoside hydrolases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Structural Analysis of Saccharomyces cerevisiae α-Galactosidase and Its Complexes with Natural Substrates Reveals New Insights into Substrate Specificity of GH27 Glycosidases
Rafael Fernández-Leiro ... Juliana Sanz-Aparicio
Journal of Biological Chemistry | VOL. 285
Rafael Fernández-Leiro, et. al.Rafael Fernández-Leiro ... Juliana Sanz-Aparicio
01 Sep 2010
Journal of Biological Chemistry | VOL. 285

Genes for degradation and utilization of uronic acid-containing polysaccharides of a marine bacterium Catenovulum sp. CCB-QB4.
Go Furusawa ... Nor Azura Azami
PeerJ | VOL. 9
Go Furusawa, et. al.Go Furusawa ... Nor Azura Azami
09 Mar 2021
PeerJ | VOL. 9

Performance of Hidden Markov Models in Recovering the Standard Classification of Glycoside Hydrolases.
Mariana Fonseca Rossi ... Beatriz Mello
Evolutionary Bioinformatics | VOL. 13
Mariana Fonseca Rossi, et. al.Mariana Fonseca Rossi ... Beatriz Mello
01 Jan 2017
Evolutionary Bioinformatics | VOL. 13

α-N-Acetylgalactosaminidase from Infant-associated Bifidobacteria Belonging to Novel Glycoside Hydrolase Family 129 Is Implicated in Alternative Mucin Degradation Pathway
Masashi Kiyohara ... Hisashi Ashida
Journal of Biological Chemistry | VOL. 287
Masashi Kiyohara, et. al.Masashi Kiyohara ... Hisashi Ashida
01 Jan 2012
Journal of Biological Chemistry | VOL. 287

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GeneHunt for rapid domain-specific annotation of glycoside hydrolases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports