GOPred: GO Molecular Function Prediction by Combined Classifiers

Ömer Sinan Saraç,Rengul Cetin-Atalay,Volkan Atalay

doi:10.1371/journal.pone.0012382

Ömer Sinan Saraç, Rengul Cetin-Atalay + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0012382

Copy DOI

Journal: PLoS ONE	Publication Date: Aug 31, 2010
Citations: 40	License type: CC BY 4.0

Affiliation: Middle East Technical University

Abstract

Functional protein annotation is an important matter for in vivo and in silico biology. Several computational methods have been proposed that make use of a wide range of features such as motifs, domains, homology, structure and physicochemical properties. There is no single method that performs best in all functional classification problems because information obtained using any of these features depends on the function to be assigned to the protein. In this study, we portray a novel approach that combines different methods to better represent protein function. First, we formulated the function annotation problem as a classification problem defined on 300 different Gene Ontology (GO) terms from molecular function aspect. We presented a method to form positive and negative training examples while taking into account the directed acyclic graph (DAG) structure and evidence codes of GO. We applied three different methods and their combinations. Results show that combining different methods improves prediction accuracy in most cases. The proposed method, GOPred, is available as an online computational annotation tool (http://kinaz.fen.bilkent.edu.tr/gopred).

Highlights

Due to advances in genome sequencing techniques during the last decade, the number of proteins being identified is exponentially increasing
We developed a method to prepare training data for the terms defined in Gene Ontology (GO) framework
We present a way of establishing positive and negative training data for each class based on evidence codes provided by the GO annotation (GOA) project and by considering the structure of the GO directed acyclic graph (DAG)

Summary

Introduction

Due to advances in genome sequencing techniques during the last decade, the number of proteins being identified is exponentially increasing. Functional annotation of proteins has become one of the central problems in molecular biology. Annotations of the highest-scoring hits, according to a similarity calculation, are transfered onto the target protein. This track can be called the transfer approach. Despite some known drawbacks such as excessive transfering of annotations, low sensitivity, low specificity, and propagation of database errors, this track is the most widely used among biologists because as it is historically the first successful method but developed when the number of protein sequences in the databases was much lower than today’s [1,2,3,4,5,6], it is well understood and widely used by the experimentalists

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GOPred: GO Molecular Function Prediction by Combined Classifiers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds
Raquel Rodríguez-Pérez ... Martin Vogt
Journal of Chemical Information and Modeling | VOL. 57
Raquel Rodríguez-Pérez, et. al.Raquel Rodríguez-Pérez ... Martin Vogt
10 Apr 2017
Journal of Chemical Information and Modeling | VOL. 57

Exploiting MEDLINE for gene molecular function prediction via NMF based multi-label classification.
Samah Jamal Fodeh ... Aditya Tiwari
Journal of Biomedical Informatics | VOL. 86
Samah Jamal Fodeh, et. al.Samah Jamal Fodeh ... Aditya Tiwari
18 Aug 2018
Journal of Biomedical Informatics | VOL. 86

Clustering of protein-protein interactions (PPI) and gene ontology molecular function using Markov clustering and fuzzy K partite algorithm
W A Kusuma ... M Suryono
IOP Conference Series: Earth and Environmental Science | VOL. 299
W A Kusuma, et. al.W A Kusuma ... M Suryono
01 Jul 2019
IOP Conference Series: Earth and Environmental Science | VOL. 299

Using neural networks and evolutionary information in decoy discrimination for protein tertiary structure prediction
Ching-Wai Tan ... David T Jones
BMC Bioinformatics | VOL. 9
Ching-Wai Tan, et. al.Ching-Wai Tan ... David T Jones
11 Feb 2008
BMC Bioinformatics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GOPred: GO Molecular Function Prediction by Combined Classifiers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE