Abstract

BackgroundStructure comparison can provide useful information to identify functional and evolutionary relationship between proteins. With the dramatic increase of protein structure data in the Protein Data Bank, computation time quickly becomes the bottleneck for large scale structure comparisons. To more efficiently deal with informative multiple structure alignment tasks, we propose pmTM-align, a parallel protein structure alignment approach based on mTM-align/TM-align. pmTM-align contains two stages to handle pairwise structure alignments with Spark and the phylogenetic tree-based multiple structure alignment task on a single computer with OpenMP.ResultsExperiments with the SABmark dataset showed that parallelization along with data structure optimization provided considerable speedup for mTM-align. The Spark-based structure alignments achieved near ideal scalability with large datasets, and the OpenMP-based construction of the phylogenetic tree accelerated the incremental alignment of multiple structures and metrics computation by a factor of about 2–5.ConclusionspmTM-align enables scalable pairwise and multiple structure alignment computing and offers more timely responses for medium to large-sized input data than existing alignment tools such as mTM-align.

Highlights

  • Structure comparison can provide useful information to identify functional and evolutionary relationship between proteins

  • Many pairwise structure alignment (PSA) methods are developed in this aim, like DALI [2], CE [3], TM-align [4], etc

  • Since pmTM-align contains two stages that are run with different tools, i.e. Spark and OpenMP, we designed the experiment in a two-stage manner

Read more

Summary

Introduction

Structure comparison can provide useful information to identify functional and evolutionary relationship between proteins. To more efficiently deal with informative multiple structure alignment tasks, we propose pmTM-align, a parallel protein structure alignment approach based on mTM-align/TM-align. PmTM-align contains two stages to handle pairwise structure alignments with Spark and the phylogenetic tree-based multiple structure alignment task on a single computer with OpenMP. Structure comparison can be used to identify functional and evolutionary relationship between proteins, which are very useful for functional annotation, structure-based drug design, protein–protein docking, and many other applications [1]. To compare two protein structures, we first need to find the best structural alignment between two proteins to initiate residue-level comparison. Chen et al BMC Bioinformatics (2020) 21:426 is noted as multiple structure alignment (MSTA). As a natural extension of PSA, MSTA can be built with progressive merging of PSA results [5], or based on iterative fragment alignment and assembly [6]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call