Abstract
SummarySlurm resource management system is used on many TOP500 supercomputers. We present a new auction‐based heterogeneous cluster scheduler plug‐in called AUCSCHED2. AUCSCHED2 contributes two major enhancements: the first is the extension of Slurm to support generic resource moldability by specification of resource ranges. The generic resources include accelerators like graphics processing unit or Xeon Phi. The current version of Slurm supports specification of node ranges but not of generic resource ranges. Such a feature can be very useful to run‐time auto‐tuning applications and systems that can make use of variable number of generic resources. The second enhancement involves the implementation of a new integer programming formulation in AUCSCHED2 that drastically reduces the number of variables. This allows faster solution and larger number of bids to be generated. Slurm emulation results are presented for the heterogeneous 1408 node Tsubame supercomputer, which has 12 cores and three graphics processing units on each of its nodes. AUCSCHED2 is available at https://github.com/aucsched/aucsched2. Copyright © 2015 John Wiley & Sons, Ltd.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Concurrency and Computation: Practice and Experience
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.