Abstract

Desktop Grid is different from Grid in terms of the characteristics of resources as well as types of sharing. Particularly, resource providers in Desktop Grid are volatile, heterogeneous, faulty, and malicious. These distinct features make it difficult for a scheduler to allocate tasks. Moreover, they deteriorate reliability of computation and performance. Availability metrics can forecast unavailability & can provide schedulers with information about reliability which helps them to make better scheduling decision when combined them with information about speed. This paper using these metrics for deciding when to replicate jobs & how much to replicate. In particular our metrics forecast the probability that a job will complete uninterrupted & our schedulers replicate those jobs that are least likely to do so. Our policy outperforms other replication policies as measured by improved Total CPU Time & reduced Waiting Time & Failure count

Highlights

  • Grid computing technology provides resource sharing and resource virtualization to end-users, allowing for computational resources to be accessed as a utility

  • The inherent wide distribution, heterogeneity, and dynamism of Grid environments makes them better suited to the execution of loosely-coupled parallel applications, such as Bag-of-Tasks [2] (BoT) applications, rather than of tightlycoupled ones

  • Bag-of-Tasks applications are able to exploit the computing power provided by Grids [3] and, despite their simplicity, are used in a variety of domains, such as parameter sweep, simulations, fractal calculations, computational biology, and computer imaging

Read more

Summary

INTRODUCTION

Grid computing technology provides resource sharing and resource virtualization to end-users, allowing for computational resources to be accessed as a utility. Scheduling algorithms tailored to this class of applications have recently received the attention of the Grid community [3, 4, 5] These algorithms enable BoT applications to achieve very good performance, they suffer from a common drawback, namely their reliance on the assumption that the resources in a Grid are perfectly reliable, i.e. that they will never fail or become unavailable during the execution of a task. This paper aims at filling this gap by proposing a novel fault-tolerant scheduler with dynamic replication for BoT applications, in which resources will be selected on the basis of computation and memory power and on the basis of resource reliability.

RELATED WORK
EXISTING SCHEDULERS
The Standard WQR Scheduler
The WorkQueue with Replication – Fault Tolerant Scheduler
11: WQR-FT algorithm
PROPOSED SCHEDULER
The WorkQueue with Dynamic Replication – Fault Tolerant Scheduler
CONCLUSION AND FUTURE WORK
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.