Abstract

SummarySlurm resource management system is used on many TOP500 supercomputers. We present a new auction‐based heterogeneous cluster scheduler plug‐in called AUCSCHED2. AUCSCHED2 contributes two major enhancements: the first is the extension of Slurm to support generic resource moldability by specification of resource ranges. The generic resources include accelerators like graphics processing unit or Xeon Phi. The current version of Slurm supports specification of node ranges but not of generic resource ranges. Such a feature can be very useful to run‐time auto‐tuning applications and systems that can make use of variable number of generic resources. The second enhancement involves the implementation of a new integer programming formulation in AUCSCHED2 that drastically reduces the number of variables. This allows faster solution and larger number of bids to be generated. Slurm emulation results are presented for the heterogeneous 1408 node Tsubame supercomputer, which has 12 cores and three graphics processing units on each of its nodes. AUCSCHED2 is available at https://github.com/aucsched/aucsched2. Copyright © 2015 John Wiley & Sons, Ltd.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call