Toward High-Throughput, Multicriteria Protein-Structure Comparison and Analysis

Azhar Ali Shah,Gianluigi Folino,Natalio Krasnogor

doi:10.1109/tnb.2010.2043851

Abstract

Protein-structure comparison (PSC) is an essential component of biomedical research as it impacts on, e.g., drug design, molecular docking, protein folding and structure prediction algorithms as well as being essential to the assessment of these predictions. Each of these applications, as well as many others where molecular comparison plays an important role, requires a different notion of similarity that naturally lead to the multicriteria PSC (MC-PSC) problem. Protein (Structure) Comparison, Knowledge, Similarity, and Information (ProCKSI) (www.procksi.org) provides algorithmic solutions for the MC-PSC problem by means of an enhanced structural comparison that relies on the principled application of information fusion to similarity assessments derived from multiple comparison methods. Current MC-PSC works well for moderately sized datasets and it is time consuming as it provides public service to multiple users. Many of the structural bioinformatics applications mentioned above would benefit from the ability to perform, for a dedicated user, thousands or tens of thousands of comparisons through multiple methods in real time, a capacity beyond our current technology. In this paper, we take a key step into that direction by means of a high-throughput distributed reimplementation of ProCKSI for very large datasets. The core of the proposed framework lies in the design of an innovative distributed algorithm that runs on each compute node in a cluster/grid environment to perform structure comparison of a given subset of input structures using some of the most popular PSC methods [e.g., universal similarity metric (USM), maximum contact map overlap (MaxCMO), fast alignment and search tool (FAST), distance alignment (DaliLite), combinatorial extension (CE), template modeling alignment (TMAlign)]. We follow this with a procedure of distributed consensus building. Thus, the new algorithms proposed here achieve ProCKSI's similarity assessment quality but with a fraction of the time required by it. Our results show that the proposed distributed method can be used efficiently to compare: 1) a particular protein against a very large protein structures dataset (target-against-all comparison), and 2) a particular very large-scale dataset against itself or against another very large-scale dataset (all-against-all comparison). We conclude the paper by enumerating some of the outstanding challenges for real-time MC-PSC.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward High-Throughput, Multicriteria Protein-Structure Comparison and Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on NanoBioscience

Lead the way for us

Journal: IEEE Transactions on NanoBioscience	Publication Date: Jun 1, 2010
Citations: 47

Similar Papers

ProCKSI: a decision support system for Protein (structure) Comparison, Knowledge, Similarity and Information.
Daniel Barthel ... Jonathan D Hirst
BMC bioinformatics | VOL. 8
Daniel Barthel, et. al.Daniel Barthel ... Jonathan D Hirst
26 Oct 2007
BMC bioinformatics | VOL. 8

A fuzzy sets based generalization of contact maps for the overlap of protein structures
David Pelta ... Edmund Burke
Fuzzy Sets and Systems | VOL. 152
David Pelta, et. al.David Pelta ... Edmund Burke
19 Nov 2004
Fuzzy Sets and Systems | VOL. 152

An aggregate analysis of many predicted structures to reduce errors in protein structure comparison caused by conformational flexibility
Brian G Godshall ... Wenjie Yang
BMC Structural Biology | VOL. 13
Brian G Godshall, et. al.Brian G Godshall ... Wenjie Yang
01 Nov 2013
BMC Structural Biology | VOL. 13

LOPAL and SCAMP: techniques for the comparison and display of protein structures
Geoffrey J Barton ... Michael J.E Sternberg
Journal of Molecular Graphics | VOL. 6
Geoffrey J Barton, et. al.Geoffrey J Barton ... Michael J.E Sternberg
01 Dec 1988
Journal of Molecular Graphics | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward High-Throughput, Multicriteria Protein-Structure Comparison and Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on NanoBioscience