Approximation algorithms and heuristics for task scheduling in data‐intensive distributed systems

Marcelo G Póvoa,Eduardo C Xavier

doi:10.1111/itor.12527

Abstract

AbstractIn this work, we are interested in the problem of task scheduling on large‐scale data‐intensive computing systems. In order to achieve good performance, one must construct not only good task schedules but also good data allocation across nodes on the system, since before a task can be executed, it must have access to data distributed on the system. In this article, we present a general formulation of a static problem that combines both scheduling and replication problems in data‐intensive distributed systems. We show that this problem does not admit an approximation algorithm. However, considering a restricted version of the problem that considers some practical constraints, an approximation algorithm can be designed. From a practical perspective, we introduce a novel heuristic for the problem that is based on nodes clustering. We compare the heuristic with two adapted approaches from other works in the literature by computational simulations using an extensive set of instances based on real computer grids. We show that our heuristic often obtains the best solutions and also runs faster than other approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approximation algorithms and heuristics for task scheduling in data‐intensive distributed systems

Abstract

Talk to us

Similar Papers

More From: International Transactions in Operational Research

Lead the way for us

Journal: International Transactions in Operational Research	Publication Date: Mar 23, 2018
Citations: 1

Similar Papers

HIGA: Harmony-inspired genetic algorithm for rack-aware energy-efficient task scheduling in cloud data centers
Mohan Sharma ... Ritu Garg
Engineering Science and Technology, an International Journal | VOL. 23
Mohan Sharma, et. al.Mohan Sharma ... Ritu Garg
19 Apr 2019
Engineering Science and Technology, an International Journal | VOL. 23

Task Scheduling Techniques for Energy Efficiency in the Cloud
Sanna Mehraj Kak ... M Afshar Alam
EAI Endorsed Transactions on Energy Web | VOL. 9
Sanna Mehraj Kak, et. al.Sanna Mehraj Kak ... M Afshar Alam
20 Jun 2022
EAI Endorsed Transactions on Energy Web | VOL. 9

MOGATS: a multi-objective genetic algorithm-based task scheduling for heterogeneous embedded systems
Mohsen Raji ... Mohaddaseh Nikseresht
International Journal of Embedded Systems | VOL. 14
Mohsen Raji, et. al.Mohsen Raji ... Mohaddaseh Nikseresht
01 Jan 2020
International Journal of Embedded Systems | VOL. 14

Bounds on Multiprocessing Timing Anomalies
R L Graham
SIAM Journal on Applied Mathematics | VOL. 17
R L GrahamR L Graham
01 Mar 1969
SIAM Journal on Applied Mathematics | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximation algorithms and heuristics for task scheduling in data‐intensive distributed systems

Abstract

Talk to us

Similar Papers

More From: International Transactions in Operational Research