Dimensioning the virtual cluster for parallel scientific workflows in clouds

Daniel De Oliveira,Kary Ocana,Marta Mattoso,Vitor Viana,Eduardo Ogasawara

doi:10.1145/2465848.2465852

Abstract

Cloud computing has established itself as a solid computational model that allows for scientists to use a series of distributed virtual resources to execute a wide range of scientific experiments. In several cases, there is a demand for high performance in executing these experiments since many activities are data and computing intensive. Parallelism techniques are a key issue in this experimentation process. There are approaches that provide parallelism capabilities for scientific workflows in clouds. However, most of them rely on the scientist to dimension the virtual cluster to be instantiated. Dimensioning the virtual cluster to execute the workflow in parallel may be a hard task to accomplish, i.e. it is hard to define and adapt the optimal number of virtual machines to be used. Most systems follow this manual configuration of the scientist for the whole workflow execution, using adaptive techniques only in the presence of failures. Due to the huge number of options (virtual machine types) to configure a cloud environment, the configuration task commonly becomes impractical to be performed manually, and if it is not adjusted adaptively during the execution, it can impact negatively on workflow performance, or it can produce excessive increase in financial cost. This paper proposes a service called SciDim which is based on the use of a multi-objective cost function allied to genetic algorithms and provenance data to help determining an ideal initial configuration for the virtual cluster, under budget and deadline constraints set by the scientist

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dimensioning the virtual cluster for parallel scientific workflows in clouds

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Optimizing virtual machine allocation for parallel scientific workflows in federated clouds
Rafaelli De C Coutinho ... Daniel De Oliveira
Future Generation Computer Systems | VOL. 46
Rafaelli De C Coutinho, et. al.Rafaelli De C Coutinho ... Daniel De Oliveira
22 Oct 2014
Future Generation Computer Systems | VOL. 46

A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Daniel De Oliveira ... Kary A C S Ocaña
Journal of Grid Computing | VOL. 10
Daniel De Oliveira, et. al.Daniel De Oliveira ... Kary A C S Ocaña
25 Aug 2012
Journal of Grid Computing | VOL. 10

Evaluating Grasp-based cloud dimensioning for comparative genomics: A practical approach
Rafaelli Coutinho ... Daniel De Oliveira
-
Rafaelli Coutinho, et. al.Rafaelli Coutinho ... Daniel De Oliveira
01 Sep 2014
01 Sep 2014

A Dynamic Cloud Dimensioning Approach for Parallel Scientific Workflows: a Case Study in the Comparative Genomics Domain
Rafaelli Coutinho ... Kary Ocaña
Journal of Grid Computing | VOL. 14
Rafaelli Coutinho, et. al.Rafaelli Coutinho ... Kary Ocaña
02 Jun 2016
Journal of Grid Computing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dimensioning the virtual cluster for parallel scientific workflows in clouds

Abstract

Talk to us

Similar Papers