A Kernel Framework for Protein Residue Annotation

Huzefa Rangwala,George Karypis,Christopher Kauffman

doi:10.1007/978-3-642-01307-2_40

Abstract

Over the last decade several prediction methods have been developed for determining structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate and generalizable prediction models. We developed a general purpose protein residue annotation toolkit (Pro SAT ) to allow biologists to formulate residue-wise prediction problems. Pro SAT formulates annotation problem as a classification or regression problem using support vector machines. For every residue Pro SAT captures local information (any sequence-derived information) around the reside to create fixed length feature vectors. Pro SAT implements accurate and fast kernel functions, and also introduces a flexible window-based encoding scheme that allows better capture of signals for certain prediction problems. In this work we evaluate the performance of Pro SAT on the disorder prediction and contact order estimation problems, studying the effect of the different kernels introduced here. Pro SAT shows better or at least comparable performance to state-of-the-art prediction systems. In particular Pro SAT has proven to be the best performing transmembrane-helix predictor on an independent blind benchmark. Availability: http://bio.dtc.umn.edu/prosat

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Kernel Framework for Protein Residue Annotation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Svm PRAT: SVM-based Protein Residue Annotation Toolkit
Huzefa Rangwala ... George Karypis
BMC Bioinformatics | VOL. 10
Huzefa Rangwala, et. al.Huzefa Rangwala ... George Karypis
01 Dec 2009
BMC Bioinformatics | VOL. 10

Contributions to k-means clustering and regression via classification algorithms
...
-
, et. al. ...
12 Jul 2014
12 Jul 2014

Evolutionary Machine Learning Techniques
-
-
--
01 Jan 2020
01 Jan 2020

Predicting RNA-protein interactions using only sequence information.
Usha K Muppirala ... Drena Dobbs
BMC Bioinformatics | VOL. 12
Usha K Muppirala, et. al.Usha K Muppirala ... Drena Dobbs
01 Dec 2011
BMC Bioinformatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Kernel Framework for Protein Residue Annotation

Abstract

Talk to us

Similar Papers