Effectiveness of Moldable and Malleable Scheduling in Deep Learning Tasks

Ikki Fujiwara,Keniiro Taura,Masahiro Tanaka,Kentaro Torisawa

doi:10.1109/padsw.2018.8644536

Abstract

Research and development of deep learning (DL) applications often involves exhaustive trial-and-error, which demands that shared computational resources, especially GPUs, be efficiently allocated. Most DL tasks are moldable or malleable (i.e., the number of allocated GPUs can be changed before or during execution). However, conventional batch schedulers do not take advantage of DL tasks' moldability/malleability, inhibiting speedup when some GPU resources are unallocated. Another opportunity for speedup is to run multiple tasks concurrently on one GPU, which may improve the overall throughput because a single task does not always fully utilize the GPU's computational resources. We propose designing a batch scheduling system that exploits these opportunities to accelerate DL tasks. As a first step, this study conducts an extensive case study to evaluate the speedup of DL tasks when a scheduler treats them as moldable or malleable. That is, the scheduler adjusts the number of GPUs to be (or already) allocated to a task in response to the fluctuating availability of GPUs. Simulations using our real workload trace show that if the scheduler can allocate 1–4 GPUs to a task or assign 1–4 tasks to a GPU, then the average flow time of moldable/malleable DL tasks is shortened by at least 15.1 %/42.5 %, respectively, compared to a Rigid FCFS schedule in which one GPU is allocated to each task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effectiveness of Moldable and Malleable Scheduling in Deep Learning Tasks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

RT-mDL
Neiwen Ling ... Guoliang Xing
-
Neiwen Ling, et. al.Neiwen Ling ... Guoliang Xing
15 Nov 2021
15 Nov 2021

Accelerating Deep Learning Tasks with Optimized GPU-assisted Image Decoding
Lipeng Wang ... Shengen Yan
-
Lipeng Wang, et. al.Lipeng Wang ... Shengen Yan
01 Dec 2020
01 Dec 2020

Deep Learning Research and Development Platform: Characterizing and Scheduling with QoS Guarantees on GPU Clusters
Zhaoyun Chen ... Wei Quan
IEEE Transactions on Parallel and Distributed Systems | VOL. 31
Zhaoyun Chen, et. al.Zhaoyun Chen ... Wei Quan
01 Jan 2020
IEEE Transactions on Parallel and Distributed Systems | VOL. 31

Deep learning applications in pulmonary medical imaging: recent updates and insights on COVID-19.
Hanan Farhat ... Rima Kilany
Machine vision and applications | VOL. 31
Hanan Farhat, et. al.Hanan Farhat ... Rima Kilany
28 Jul 2020
Machine vision and applications | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effectiveness of Moldable and Malleable Scheduling in Deep Learning Tasks

Abstract

Talk to us

Similar Papers