A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing

Kefeng Deng,Jinjun Chen,Junqiang Song,Yang Xiang,Dong Yuan,Kaijun Ren

doi:10.1002/cpe.3084

Abstract

SUMMARYDue to its advantages of cost‐effectiveness, on‐demand provisioning and easy for sharing, cloud computing has grown in popularity with the research community for deploying scientific applications such as workflows. Although such interests continue growing and scientific workflows are widely deployed in collaborative cloud environments that consist of a number of data centers, there is an urgent need for exploiting strategies which can place application datasets across globally distributed data centers and schedule tasks according to the data layout to reduce both latency and makespan for workflow execution. In this paper, by utilizing dependencies among datasets and tasks, we propose an efficient data and task coscheduling strategy that can place input datasets in a load balance way and meanwhile, group the mostly related datasets and tasks together. Moreover, data staging is used to overlap task execution with data transmission in order to shorten the start time of tasks. We build a simulation environment on Tianhe supercomputer for evaluating the proposed strategy and run simulations by random and realistic workflows. The results demonstrate that the proposed strategy can effectively improve scheduling performance while reducing the total volume of data transfer across data centers. Concurrency and Computation: Practice and Experience, 2013.© 2013 Wiley Periodicals, Inc.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Jul 4, 2013
Citations: 23

Similar Papers

A Weighted K-Means Clustering Based Co-scheduling Strategy towards Efficient Execution of Scientific Workflows in Collaborative Cloud Environments
Kefeng Deng ... Lingmei Kong
-
Kefeng Deng, et. al.Kefeng Deng ... Lingmei Kong
01 Dec 2011
01 Dec 2011

Graph-Cut Based Coscheduling Strategy Towards Efficient Execution of Scientific Workflows in Collaborative Cloud Environments
Kefeng Deng ... Junqiang Song
-
Kefeng Deng, et. al.Kefeng Deng ... Junqiang Song
01 Sep 2011
01 Sep 2011

Bounds on Multiprocessing Timing Anomalies
R L Graham
SIAM Journal on Applied Mathematics | VOL. 17
R L GrahamR L Graham
01 Mar 1969
SIAM Journal on Applied Mathematics | VOL. 17

Improved Harris Hawks Optimization Algorithm Based Data Placement Strategy for Integrated Cloud and Edge Computing
V Nivethitha ... G Aghila
Intelligent Automation & Soft Computing | VOL. 37
V Nivethitha, et. al.V Nivethitha ... G Aghila
01 Jan 2023
Intelligent Automation & Soft Computing | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience