Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud

Dariusz Mrozek,Bożena Małysiak-Mrozek,Paweł Gosk

doi:10.1007/s10723-015-9353-8

Dariusz Mrozek, Bożena Małysiak-Mrozek + Show 1 more

Open Access

https://doi.org/10.1007/s10723-015-9353-8

Copy DOI

Journal: Journal of Grid Computing	Publication Date: Nov 12, 2015
Citations: 33	License type: CC BY 4.0

Affiliation: Silesian University of Technology

Abstract

Computational methods for protein structure prediction allow us to determine a three-dimensional structure of a protein based on its pure amino acid sequence. These methods are a very important alternative to costly and slow experimental methods, like X-ray crystallography or Nuclear Magnetic Resonance. However, conventional calculations of protein structure are time-consuming and require ample computational resources, especially when carried out with the use of ab initio methods that rely on physical forces and interactions between atoms in a protein. Fortunately, at the present stage of the development of computer science, such huge computational resources are available from public cloud providers on a pay-as-you-go basis. We have designed and developed a scalable and extensible system, called Cloud4PSP, which enables predictions of 3D protein structures in the Microsoft Azure commercial cloud. The system makes use of the Warecki-Znamirowski method as a sample procedure for protein structure prediction, and this prediction method was used to test the scalability of the system. The results of the efficiency tests performed proved good acceleration of predictions when scaling the system vertically and horizontally. In the paper, we show the system architecture that allowed us to achieve such good results, the Cloud4PSP processing model, and the results of the scalability tests. At the end of the paper, we try to answer which of the scaling techniques, scaling out or scaling up, is better for solving such computational problems with the use of Cloud computing.

Highlights

Protein structure prediction is one of the most important and yet difficult processes for modern computational biology and structural bioinformatics [21]
The practical role of protein structure prediction is becoming even more important in the face of the dynamically growing number of protein sequences obtained through the translation of DNA sequences coming from large-scale sequencing projects
Computational procedures that allow for the determination of a protein structure from its amino acid sequence are great alternatives for experimental methods for protein structure determination, like X-ray crystallography and Nuclear Magnetic Resonance (NMR)

Summary

Introduction

Protein structure prediction is one of the most important and yet difficult processes for modern computational biology and structural bioinformatics [21]. Experimental methods for the determination of protein structures, such as X-ray crystallography or Nuclear Magnetic Resonance (NMR), are lagging behind the number of protein sequences. Computational procedures that allow for the determination of a protein structure from its amino acid sequence are great alternatives for experimental methods for protein structure determination, like X-ray crystallography and NMR. Protein structure prediction refers to the computational procedure that delivers a three-dimensional structure of a protein based on its amino acid sequence (Fig. 1). There are various approaches to the problem and many algorithms have been developed These methods generally fall into two groups: (1) physical and (2) comparative [35, 72]. Most of them try to reproduce nature’s algorithm and implement it as a computational procedure in order to give proteins their unique 3D native conformations [43].

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Grid Computing

Lead the way for us

Similar Papers

An Improved Integration of Template-Based and Template-Free Protein Structure Modeling Methods and its Assessment in CASP11
Jilong Li ... Jianlin Cheng
Protein & Peptide Letters | VOL. 22
Jilong Li, et. al.Jilong Li ... Jianlin Cheng
01 Jan 2015
Protein & Peptide Letters | VOL. 22

Exploring the conformational space for protein folding with sequential Monte Carlo
Samuel W K Wong ... S C Kou
The Annals of Applied Statistics | VOL. 12
Samuel W K Wong, et. al.Samuel W K Wong ... S C Kou
01 Sep 2018
The Annals of Applied Statistics | VOL. 12

Genetic algorithm feature-based resampling for protein structure prediction
Trent Higgs ... Bela Stantic
-
Trent Higgs, et. al.Trent Higgs ... Bela Stantic
01 Jul 2010
01 Jul 2010

From Interatomic Distances to Protein Tertiary Structures with a Deep Convolutional Neural Network
Yuanqi Du ... Amarda Shehu
-
Yuanqi Du, et. al.Yuanqi Du ... Amarda Shehu
21 Sep 2020
21 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Grid Computing