Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers

Wei Chen,Xiaobo Zhou,Jia Rao

doi:10.1109/tpds.2019.2957754

Wei Chen, Xiaobo Zhou + Show 1 more

Open Access

https://doi.org/10.1109/tpds.2019.2957754

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Datacenters are evolving to host heterogeneous workloads on shared clusters to reduce the operational cost and achieve higher resource utilization. However, it is challenging to schedule heterogeneous workloads with diverse resource requirements and QoS constraints. On one hand, latency-critical jobs need to be scheduled as soon as they are submitted to avoid any queuing delays. On the other hand, best-effort long jobs should be allowed to occupy the cluster when there are idle resources to improve cluster utilization. The challenge lies in how to minimize the queuing delays of short jobs while maximizing cluster utilization. In this article, we propose and develop BIG-C, a container-based resource management framework for data-intensive cluster computing. The key design is to leverage lightweight virtualization, a.k.a, containers, to make tasks preemptable in cluster scheduling. We devise two types of preemption strategies: immediate and graceful preemptions and show their effectiveness and tradeoffs with loosely-coupled MapReduce workloads as well as iterative, in-memory Spark workloads. Based on the mechanisms for task preemption, we further develop job-level and task-level preemptive policies as well as a preemptive fair share cluster scheduler. Our implementation on Yarn and evaluation with synthetic and production workloads show that low job latency and high resource utilization can be both attained when scheduling heterogeneous workloads on a contended cluster.

Full Text

Accepted Version

View

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Dec 1, 2020
Citations: 13	License type: publisher-specific, author manuscript

R Discovery Prime

Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers

Abstract

Accepted Version

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Similar Papers

Dependency-Aware and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds
Jinwei Liu ... Haiying Shen
-
Jinwei Liu, et. al.Jinwei Liu ... Haiying Shen
01 Dec 2016
01 Dec 2016

Fregata: A Low-Latency and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds
Jinwei Liu
-
Jinwei LiuJinwei Liu
01 Jan 2021
01 Jan 2021

Abstract 14448: On-Table Extubation Associated With Lower Resource Utilization and Non-Inferior Outcomes After Coronary Bypass
Stacey Telenson ... Mark J Russo
Circulation | VOL. 146
Stacey Telenson, et. al.Stacey Telenson ... Mark J Russo
08 Nov 2022
Circulation | VOL. 146

Variability in postoperative resource utilization after pancreaticoduodenectomy: Who is responsible
Audrey E Ertel ... Daniel E Abbott
Surgery | VOL. 160
Audrey E Ertel, et. al.Audrey E Ertel ... Daniel E Abbott
04 Oct 2016
Surgery | VOL. 160

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers

Abstract

Accepted Version

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems