DDS: A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints

Yang Wang,Paul Lu

doi:10.1016/j.parco.2013.04.006

Abstract

Workflow-based workloads usually consist of multiple instances of the same workflow, which are jobs with control or data dependencies, to carry out a well-defined scientific computation task, with each instance acting on its own input data. To maximize throughput performance, a high degree of concurrency is achievable by running multiple instances simultaneously. However, deadlock is a potential problem when storage is constrained. To address this problem, we design and evaluate a deadlock detection-based scheduling (DDS) algorithm that can achieve high performance by making the best use of the available storage resources. Our algorithm takes advantages of the dataflow information of the workflow to speculatively schedule each instance if the instant storage is sufficient for some constituent jobs, but not necessarily for the whole workflow instance. Whenever deadlock or a performance anomaly is detected, some selected in-progress workflow instances are required to be rollbacked to release storage for other blocked jobs. We develop a suite of strategies to select the victims and beneficiaries (instances or jobs) and evaluate their performance via a simulation-based study. Our results show that the DDS algorithm can adapt the job concurrency to the available storage resources and achieve higher performance than some deadlock avoidance methods in our synthetic and real workflow computations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DDS: A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Journal: Parallel Computing	Publication Date: May 4, 2013
Citations: 13

Similar Papers

Maximizing Active Storage Resources with Deadlock Avoidance in Workflow-Based Computations
Yang Wang ... Paul Lu
IEEE Transactions on Computers | VOL. 62
Yang Wang, et. al.Yang Wang ... Paul Lu
01 Nov 2013
IEEE Transactions on Computers | VOL. 62

Dataflow-Based Scheduling for Scientific Workflows in HPC with Storage Constraints
... Wei Shi
The Computer Journal | VOL. 58
, et. al. ... Wei Shi
17 Oct 2014
The Computer Journal | VOL. 58

Scheduling parameter sweep workflow in the grid

-

17 Feb 2017
17 Feb 2017

Utilizing Heterogeneous Data Sources in Computational Grid Workflows
Tamas Kiss ... Peter Kacsuk
-
Tamas Kiss, et. al.Tamas Kiss ... Peter Kacsuk
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DDS: A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints

Abstract

Talk to us

Similar Papers

More From: Parallel Computing