Online Scheduling with Redirection for Parallel Jobs

Adrien Faure,Denis Trystram,Giorgio Lucarelli,Olivier Richard

doi:10.1109/ipdpsw50202.2020.00066

Online Scheduling with Redirection for Parallel Jobs

Adrien Faure, Denis Trystram + Show 2 more

Open Access

https://doi.org/10.1109/ipdpsw50202.2020.00066

Copy DOI

Publication Date: May 1, 2020

Affiliation: Atos (France), French National Centre for Scientific Research, Grenoble Institute of Technology, French Institute for Research in Computer Science and Automation, Grenoble Computer Science Laboratory, Université Grenoble Alpes, Université de Lorraine

#Component Of High Performance Computing #Job Scheduling Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

An important component of High Performance Computing (HPC) clusters is the job scheduling algorithm, which decides the allocation and the scheduling of the jobs in the system. Such scheduling algorithms need to be scalable to confront the growth both in size and in complexity of the modern clusters. We propose in this paper a new algorithm for scheduling parallel jobs with redirection. Specifically, our algorithm redirects the jobs whose execution affects significantly an important number of other jobs. A redirected job is stopped and restarted from the beginning in a dedicated part of the cluster. We show the effectiveness of our method through an intensive experimental campaign of simulations of production cluster log traces.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.