Performance analysis of large-scale parallel-distributed processing with backup tasks for cloud computing

Tsuguhito Hirai,Hiroyuki Masuyama,Shoji Kasahara,Yutaka Takahashi

doi:10.3934/jimo.2014.10.113

Abstract

In cloud computing, a large-scale parallel-distributed processing service is provided where a huge task is split into a number of subtasks and those subtasks are processed on a cluster of machines called workers. In such a processing service, a worker which takes a long time for processing a subtask makes the response time long (the issue of stragglers). One of efficient methods to alleviate this issue is to execute the same subtask by another worker in preparation for the slow worker (backup tasks). In this paper, we consider the efficiency of backup tasks. We model the task-scheduling server as a single-server queue, in which the server consists of a number of workers. When a task enters the server, the task is split into subtasks, and each subtask is served by its own worker and an alternative distinct worker. In this processing, we explicitly derive task processing time distributions for the two cases that the subtask processing time of a worker obeys Weibull or Pareto distribution. We compare the mean response time and the total processing time under backup-task scheduling with those under normal scheduling. Numerical examples show that the efficiency of backup-task scheduling significantly depends on workers' processing time distribution.

Highlights

IntroductionCloud computing has attracted considerable attention due to the availability of huge computing resources and its significant cost efficiency
Cloud computing has attracted considerable attention due to the availability of huge computing resources and its significant cost efficiency.In [1], cloud computing is defined as the sum of the existing concepts, software as a service (SaaS) and utility computing
Β = 4.702 β = 3.159 β = 2.414 β = 2.095 β = 2.007. These results suggest that the coefficient of variation of the worker processing time distribution significantly affects the response time performance of the largescale parallel-distributed processing

Summary

Introduction

Cloud computing has attracted considerable attention due to the availability of huge computing resources and its significant cost efficiency. Focusing on the percentile of the response time as a performance measure of cloud computing, they approximately analyze the response time distribution In their model, the service-center part is modeled as a single-server queue with a fixed service rate, and this model is too simple to describe a large-scale parallel-distributed processing of a task. We consider the efficiency of backup-task scheduling on two performance measures: the response time of a task and the total processing time of workers. On the other hand, each subtask is processed by its own worker and by an alternative distinct worker, and the subtask service ends when either of the two workers’ processes is completed In both scheduling policies, we explicitly derive task processing time distributions when the subtask processing time of a worker follows Weibull or Pareto distribution. The assumption of subtasks of an equal size becomes accurate when a huge-sized input data is almost split into data pieces [4]

Ind ep end ent id entical d istribution FB

PERFORMANCE ANALYSIS OF BACKUP TASKS FOR CLOUD COMPUTING

The tail of the distribution

The number of workers

The ratio of the mean response time

The ratio of the total processing time

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Industrial & Management Optimization	Publication Date: Oct 1, 2013
Citations: 25	License type: other-oa

R Discovery Prime

R Discovery Prime

Performance analysis of large-scale parallel-distributed processing with backup tasks for cloud computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Industrial & Management Optimization

Lead the way for us

Similar Papers

Performance analysis of backup-task scheduling with deadline time in cloud computing
Kyosuke Hashimoto ... Hiroyuki Masuyama
Journal of Industrial & Management Optimization | VOL. 11
Kyosuke Hashimoto, et. al.Kyosuke Hashimoto ... Hiroyuki Masuyama
01 Oct 2014
Journal of Industrial & Management Optimization | VOL. 11

Heavy traffic scaling limits for shortest remaining processing time queues with heavy tailed processing time distributions
Sayan Banerjee ... Amarjit Budhiraja
The Annals of Applied Probability | VOL. 32
Sayan Banerjee, et. al.Sayan Banerjee ... Amarjit Budhiraja
01 Aug 2022
The Annals of Applied Probability | VOL. 32

Running Errands in Time: Approximation Algorithms for Stochastic Orienteering
Anupam Gupta ... Ravishankar Krishnaswamy
Mathematics of Operations Research | VOL. 40
Anupam Gupta, et. al.Anupam Gupta ... Ravishankar Krishnaswamy
01 Feb 2015
Mathematics of Operations Research | VOL. 40

On the statistical distribution of processing times in network intrusion detection
J.B.D. Cabrera ... R.K. Mehra
-
J.B.D. Cabrera, et. al.J.B.D. Cabrera ... R.K. Mehra
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance analysis of large-scale parallel-distributed processing with backup tasks for cloud computing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Industrial &amp; Management Optimization

More From: Journal of Industrial & Management Optimization