Towards structured output prediction of enzyme function.

Katja Astikainen,Sandor Szedmak,Juho Rousu,Liisa Holm,Esa Pitkänen

doi:10.1186/1753-6561-2-s4-s2

Katja Astikainen, Sandor Szedmak + Show 3 more

Open Access

https://doi.org/10.1186/1753-6561-2-s4-s2

Copy DOI

Journal: BMC Proceedings	Publication Date: Dec 1, 2008
Citations: 68	License type: cc-by

Affiliation: University of Helsinki

Abstract

In this paper we describe work in progress in developing kernel methods for enzyme function prediction. Our focus is in developing so called structured output prediction methods, where the enzymatic reaction is the combinatorial target object for prediction. We compared two structured output prediction methods, the Hierarchical Max-Margin Markov algorithm (HM3) and the Maximum Margin Regression algorithm (MMR) in hierarchical classification of enzyme function. As sequence features we use various string kernels and the GTG feature set derived from the global alignment trace graph of protein sequences. In our experiments, in predicting enzyme EC classification we obtain over 85% accuracy (predicting the four digit EC code) and over 91% microlabel F1 score (predicting individual EC digits). In predicting the Gold Standard enzyme families, we obtain over 79% accuracy (predicting family correctly) and over 89% microlabel F1 score (predicting superfamilies and families). In the latter case, structured output methods are significantly more accurate than nearest neighbor classifier. A polynomial kernel over the GTG feature set turned out to be a prerequisite for accurate function prediction. Combining GTG with string kernels boosted accuracy slightly in the case of EC class prediction. Structured output prediction with GTG features is shown to be computationally feasible and to have accuracy on par with state-of-the-art approaches in enzyme function prediction.

Highlights

In this paper we describe work in progress in developing kernel methods for enzyme function prediction
Results in EC class prediction Here we report on experiments in predicting the EC-hierarchy with Maximum Margin Regression algorithm (MMR) and HM3 using different sequence kernel combinations, with polynomial kernel applied on top
Our preliminary experiments indicated that GTG kernel is the only single kernel reaching microlabel F1 above 80%

Summary

Introduction

In this paper we describe work in progress in developing kernel methods for enzyme function prediction. We compared two structured output prediction methods, the Hierarchical Max-Margin Markov algorithm (HM3) and the Maximum Margin Regression algorithm (MMR) in hierarchical classification of enzyme function. [11] and Maximum Margin Regression, MMR [12] The former is a method designed for hierarchical multilabel classification, the latter can be seen as a generalization of one-class support vector machine to structured output domains. Bolic reconstruction and the analysis of metabolic fluxes [1] Protein function taxonomies such as Gene ontology [2] and MIPS CYGD [3] classify proteins according to many aspects, only one of them being the exact function exact (biochemical reaction catalyzed). Cai et al [6] predict membership in enzyme families one family at a time with support vector machines

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards structured output prediction of enzyme function.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings

Lead the way for us

Similar Papers

MVDINET: A Novel Multi-Level Enzyme Function Predictor With Multi-View Deep Interactive Learning.
Wenliang Tang ... Wei Zhang
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 21
Wenliang Tang, et. al.Wenliang Tang ... Wei Zhang
01 Jan 2024
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 21

Structured Output Prediction of Novel Enzyme Function with Reaction Kernels
Katja Astikainen ... Sandor Szedmak
-
Katja Astikainen, et. al.Katja Astikainen ... Sandor Szedmak
01 Jan 2010
01 Jan 2010

Exploring functionally related enzymes using radially distributed properties of active sites around the reacting points of bound ligands
Keisuke Ueno ... Toshinori Endo
BMC Structural Biology | VOL. 12
Keisuke Ueno, et. al.Keisuke Ueno ... Toshinori Endo
26 Apr 2012
BMC Structural Biology | VOL. 12

Parallel convolutional contrastive learning method for enzyme function prediction.
Xindi Yu ... Shusen Zhou
IEEE/ACM transactions on computational biology and bioinformatics | VOL. PP
Xindi Yu, et. al.Xindi Yu ... Shusen Zhou
01 Jan 2024
IEEE/ACM transactions on computational biology and bioinformatics | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards structured output prediction of enzyme function.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings