Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.

Elzbieta Rembeza,Martin K M Engqvist

doi:10.1371/journal.pcbi.1009446

Abstract

Only a small fraction of genes deposited to databases have been experimentally characterised. The majority of proteins have their function assigned automatically, which can result in erroneous annotations. The reliability of current annotations in public databases is largely unknown; experimental attempts to validate the accuracy within individual enzyme classes are lacking. In this study we performed an overview of functional annotations to the BRENDA enzyme database. We first applied a high-throughput experimental platform to verify functional annotations to an enzyme class of S-2-hydroxyacid oxidases (EC 1.1.3.15). We chose 122 representative sequences of the class and screened them for their predicted function. Based on the experimental results, predicted domain architecture and similarity to previously characterised S-2-hydroxyacid oxidases, we inferred that at least 78% of sequences in the enzyme class are misannotated. We experimentally confirmed four alternative activities among the misannotated sequences and showed that misannotation in the enzyme class increased over time. Finally, we performed a computational analysis of annotations to all enzyme classes in the BRENDA database, and showed that nearly 18% of all sequences are annotated to an enzyme class while sharing no similarity or domain architecture to experimentally characterised representatives. We showed that even well-studied enzyme classes of industrial relevance are affected by the problem of functional misannotation.

Highlights

With the steady increase of genetic information deposited to public databases, the proportion of experimentally characterised sequences continues to decline
Correct annotation of genomes is crucial for our understanding and utilization of functional gene diversity, yet the reliability of current protein annotations in public databases is largely unknown
We showed that the misannotation is widespread throughout enzyme classes, affecting even well-studied classes of industrial relevance

Summary

Introduction

With the steady increase of genetic information deposited to public databases, the proportion of experimentally characterised sequences continues to decline. As the traditional experimental methods for determining protein function cannot keep up with the increase in genomic data, high-throughput methods enabling protein family-wide substrate profiling for hundreds of enzymes are being implemented. Data generated in such approaches are important for understanding sequencefunction relationships in the tested protein families; they have led to the discovery of novel enzymatic activities as well as identified enzymes with diverse physicochemical properties [2,3,4,5,6]. Several global initiatives have been undertaken to bring together computational and experimental scientists to accelerate discovery of novel protein activities and enable more trustworthy functional annotations [7,8,9]

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Sep 23, 2021
Citations: 26	License type: CC BY 4.0

R Discovery Prime

Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class
Martin K M Engqvist ... Marco Punta
-
Martin K M Engqvist, et. al.Martin K M Engqvist ... Marco Punta
23 Sep 2021
23 Sep 2021

A Novel Method for Expanding Current Annotations in Gene Ontology
Dapeng Hao ... Shaoqi Rao
-
Dapeng Hao, et. al.Dapeng Hao ... Shaoqi Rao
01 Jan 2006
01 Jan 2006

Analysis of the genomic basis of functional diversity in dinoflagellates using a transcriptome-based sequence similarity network.
Arnaud Meng ... Stéphane Le Crom
Molecular Ecology | VOL. 27
Arnaud Meng, et. al.Arnaud Meng ... Stéphane Le Crom
01 May 2018
Molecular Ecology | VOL. 27

Genome-scale identification and characterization of moonlighting proteins.
Ishita Khan ... Xioawei Hong
Biology direct | VOL. 9
Ishita Khan, et. al.Ishita Khan ... Xioawei Hong
01 Dec 2014
Biology direct | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS Computational Biology