Supervised Learning-Aided Optimization of Expert-Driven Functional Protein Sequence Annotation

Lev Soinov,Alexander Kanapin,Misha Kapushesky

doi:10.1007/978-3-540-30219-3_14

Abstract

The aim of this work is to use a supervised learning approach to identify sets of motif-based sequence characteristics, combinations of which can give the most accurate annotation of new proteins. We assess several of InterPro Consortium member databases for their informativeness for the annotation of full-length protein sequences. Thus, our study addresses the problem of integrating biological information from various resources. Decision-rule algorithms are used to cross-map different biological classification systems in order to optimise the process of functional annotation of protein sequences. Various features (e.g., keywords, GO terms, structural complex names) may be assigned to a sequence via its characteristics (e.g., motifs built by various protein sequence analysis methods) with the developed approach. We chose SwissProt keywords as the set of features on which to perform our analysis. From the presented results one can quickly obtain the best combinations of methods appropriate for the description of a given class of proteins.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised Learning-Aided Optimization of Expert-Driven Functional Protein Sequence Annotation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Large-scale automated function prediction of protein sequences and an experimental case study validation on PTEN transcript variants.
Ahmet Sureyya Rifaioglu ... Ömer Sinan Saraç
Proteins: Structure, Function, and Bioinformatics | VOL. 86
Ahmet Sureyya Rifaioglu, et. al.Ahmet Sureyya Rifaioglu ... Ömer Sinan Saraç
29 Nov 2017
Proteins: Structure, Function, and Bioinformatics | VOL. 86

Automatic Generation of Functional Annotation Rules Using Inferred GO-Domain Associations
...
-
, et. al. ...
08 Aug 2017
08 Aug 2017

Exploiting Complex Protein Domain Networks for Protein Function Annotation
Bishnu Sarker ... David W Rtichie
-
Bishnu Sarker, et. al.Bishnu Sarker ... David W Rtichie
05 Dec 2018
05 Dec 2018

HAMAP in 2015: updates to the protein family classification and annotation system.
Ivo Pedruzzi ... Edouard De Castro
Nucleic Acids Research | VOL. 43
Ivo Pedruzzi, et. al.Ivo Pedruzzi ... Edouard De Castro
27 Oct 2014
HAMAP in 2015: updates to the protein family classification and annotation system.
Ivo Pedruzzi ... Edouard De Castro

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised Learning-Aided Optimization of Expert-Driven Functional Protein Sequence Annotation

Abstract

Talk to us

Similar Papers