Graph pyramids for protein function prediction.

Tushar Sandhan,Jin Young Choi,Sun Kim,Youngjun Yoo

doi:10.1186/1755-8794-8-s2-s12

Abstract

BackgroundUncovering the hidden organizational characteristics and regularities among biological sequences is the key issue for detailed understanding of an underlying biological phenomenon. Thus pattern recognition from nucleic acid sequences is an important affair for protein function prediction. As proteins from the same family exhibit similar characteristics, homology based approaches predict protein functions via protein classification. But conventional classification approaches mostly rely on the global features by considering only strong protein similarity matches. This leads to significant loss of prediction accuracy.MethodsHere we construct the Protein-Protein Similarity (PPS) network, which captures the subtle properties of protein families. The proposed method considers the local as well as the global features, by examining the interactions among 'weakly interacting proteins' in the PPS network and by using hierarchical graph analysis via the graph pyramid. Different underlying properties of the protein families are uncovered by operating the proposed graph based features at various pyramid levels.ResultsExperimental results on benchmark data sets show that the proposed hierarchical voting algorithm using graph pyramid helps to improve computational efficiency as well the protein classification accuracy. Quantitatively, among 14,086 test sequences, on an average the proposed method misclassified only 21.1 sequences whereas baseline BLAST score based global feature matching method misclassified 362.9 sequences. With each correctly classified test sequence, the fast incremental learning ability of the proposed method further enhances the training model. Thus it has achieved more than 96% protein classification accuracy using only 20% per class training data.

Highlights

Uncovering the hidden organizational characteristics and regularities among biological sequences is the key issue for detailed understanding of an underlying biological phenomenon
As discussed initially in the background section, here we took an approach based on protein homology for protein function prediction
We took this approach because it is fast, an approximate and primary way to tackle a daunting task of function prediction of a large number of proteins

Summary

Introduction

Uncovering the hidden organizational characteristics and regularities among biological sequences is the key issue for detailed understanding of an underlying biological phenomenon. As proteins from the same family exhibit similar characteristics, homology based approaches predict protein functions via protein classification. Conventional classification approaches mostly rely on the global features by considering only strong protein similarity matches. This leads to significant loss of prediction accuracy. Knowing just amino-acid sequence and structure of a protein does not guarantee that we can predict everything about that protein These measures are a good starting point for quickly predicting protein functions with the help of known homology. Searching for only the highest scoring match in a protein database is nothing but looking for the global feature in the sequence similarity space

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC medical genomics	Publication Date: May 29, 2015
Citations: 26	License type: cc-by

R Discovery Prime

R Discovery Prime

Graph pyramids for protein function prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC medical genomics

Lead the way for us

Similar Papers

Delineating Homology Generators in Graph Pyramids
Mabel Iglesias ... Walter G Kropatsch
-
Mabel Iglesias, et. al.Mabel Iglesias ... Walter G Kropatsch
01 Jan 2008
01 Jan 2008

Irregular Graph Pyramids and Representative Cocycles of Cohomology Generators
Rocio Gonzalez-Diaz ... Mabel Iglesias-Ham
-
Rocio Gonzalez-Diaz, et. al.Rocio Gonzalez-Diaz ... Mabel Iglesias-Ham
01 Jan 2009
01 Jan 2009

Invariant representative cocycles of cohomology generators using irregular graph pyramids
Rocio Gonzalez-Diaz ... Walter G Kropatsch
Computer Vision and Image Understanding | VOL. 115
Rocio Gonzalez-Diaz, et. al.Rocio Gonzalez-Diaz ... Walter G Kropatsch
16 Mar 2011
Computer Vision and Image Understanding | VOL. 115

Directly computing the generators of image homology using graph pyramids
Samuel Peltier ... Yll Haxhimusa
Image and Vision Computing | VOL. 27
Samuel Peltier, et. al.Samuel Peltier ... Yll Haxhimusa
05 Jul 2008
Image and Vision Computing | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph pyramids for protein function prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC medical genomics