Graphlet Kernels for Prediction of Functional Residues in Protein Structures

Vladimir Vacic,Stefano Lonardi,Lilia M Iakoucheva,Predrag Radivojac

doi:10.1089/cmb.2009.0029

Abstract

We introduce a novel graph-based kernel method for annotating functional residues in protein structures. A structure is first modeled as a protein contact graph, where nodes correspond to residues and edges connect spatially neighboring residues. Each vertex in the graph is then represented as a vector of counts of labeled non-isomorphic subgraphs (graphlets), centered on the vertex of interest. A similarity measure between two vertices is expressed as the inner product of their respective count vectors and is used in a supervised learning framework to classify protein residues. We evaluated our method on two function prediction problems: identification of catalytic residues in proteins, which is a well-studied problem suitable for benchmarking, and a much less explored problem of predicting phosphorylation sites in protein structures. The performance of the graphlet kernel approach was then compared against two alternative methods, a sequence-based predictor and our implementation of the FEATURE framework. On both tasks, the graphlet kernel performed favorably; however, the margin of difference was considerably higher on the problem of phosphorylation site prediction. While there is data that phosphorylation sites are preferentially positioned in intrinsically disordered regions, we provide evidence that for the sites that are located in structured regions, neither the surface accessibility alone nor the averaged measures calculated from the residue microenvironments utilized by FEATURE were sufficient to achieve high accuracy. The key benefit of the graphlet representation is its ability to capture neighborhood similarities in protein structures via enumerating the patterns of local connectivity in the corresponding labeled graphs.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Graphlet Kernels for Prediction of Functional Residues in Protein Structures

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Computational Biology

Lead the way for us

Journal: Journal of Computational Biology	Publication Date: Jan 1, 2010
Citations: 121

Similar Papers

Compositional preferences in quadruplets of nearest neighbor residues in protein structures: statistical geometry analysis
I.I Vaisman ... A Tropsha
-
I.I Vaisman, et. al.I.I Vaisman ... A Tropsha
21 Mar 1998
21 Mar 1998

Relating destabilizing regions to known functional sites in proteins.
Benoît H Dessailly ... Marc F Lensink
BMC Bioinformatics | VOL. 8
Benoît H Dessailly, et. al.Benoît H Dessailly ... Marc F Lensink
30 Apr 2007
BMC Bioinformatics | VOL. 8

Statistical analysis of unstructured amino acid residues in protein structures
M Yu Lobanov ... O V Galzitskaya
Biochemistry (Moscow) | VOL. 75
M Yu Lobanov, et. al.M Yu Lobanov ... O V Galzitskaya
01 Feb 2010
Biochemistry (Moscow) | VOL. 75

Thinking Outside the Informatics Box: Computed Chemical Properties for Protein Function Annotation
Mary Jo Ondrechen ... Penny J Beuning
The FASEB Journal | VOL. 33
Mary Jo Ondrechen, et. al.Mary Jo Ondrechen ... Penny J Beuning
01 Apr 2019
The FASEB Journal | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Graphlet Kernels for Prediction of Functional Residues in Protein Structures

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Computational Biology