Abstract

Generally, users working on a Grid infrastructure have a keen interest in knowing (tight) upper bounds for execution times of useful Grid services and their own submitted jobs. However, the execution environments and times in a Grid infrastructure may widely vary in practice, because of several and diverse factors, including: the unpredictable delays on network links, peaks of service requests, the relatively unpredictable outcomes of resource negotiation, changes in services and resources availability (due to partial failure, etc.).It therefore appears that, for successful Grid operation, a crucial ability is that to analyse, and accordingly tune, the amount of resources needed to smoothly execute requests presented to Grid services by user-submitted jobs. For such a tuning to be possible and effective, a suitable characterisation should be available of the times spent executing the services provided by the Grid middleware.This paper focuses on those Grid services, handling job submission, whose execution times can vary substantially, analysing what factors determine or affect this variability. We present and analyse a set of measures that have been carried on a real infrastructure, as usage conditions vary. This knowledge can be very handy to detect the occurrence of anomalous states in Grid operation, and timely start appropriate recovery actions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call