ABSTRACTGrid computing is a prominent tool for assembling and incorporating groups of heterogeneous resources scattered around the world, connected through a network. Since load balancing is a challenging issue in a distributed environment, Grid computing provides an architecture, which is well suited, low-cost, and consistent. For task execution on suitable resources, grid computing provides efficient load balancing and fault tolerance approaches. But, due to the dynamic nature of grid resources, sometimes tasks can’t be completed within given constraints (deadline, cost). To solve these issues, this paper proposes a fault tolerance–based load balancing approach by considering the dynamic nature of resources. First, significant contributions in the field of load balancing are analyzed based on several parameters. Second, a fault tolerance dynamic load balancing model is proposed for task execution based on resource load and fault index value. For fault tolerance, checkpoints are set at various determined intervals to resume tasks at the next possible instance that avoids unnecessary placement of checkpoints. Third, the proposed model is validated and provides improved performance in terms of response time, makespan, and throughput. The model provides up to 12% and 9% performance improvement depending on method of measurement in comparison to other state-of-the-art methods.
Read full abstract