Improving spark application throughput via memory aware task co-location

Vicent Sanz Marco,Ben Taylor,Zheng Wang,Barry Porter

doi:10.1145/3135974.3135984

Abstract

Data analytic applications built upon big data processing frameworks such as Apache Spark are an important class of applications. Many of these applications are not latency-sensitive and thus can run as batch jobs in data centers. By running multiple applications on a computing host, task co-location can significantly improve the server utilization and system throughput. However, effective task co-location is a non-trivial task, as it requires an understanding of the computing resource requirement of the co-running applications, in order to determine what tasks, and how many of them, can be co-located. State-of-the-art co-location schemes either require the user to supply the resource demands which are often far beyond what is needed; or use a one-size-fits-all function to estimate the requirement, which, unfortunately, is unlikely to capture the diverse behaviors of applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving spark application throughput via memory aware task co-location

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Big data processing frameworks and architectures: a survey
Raghavendra Kumar Chunduri ... Aswani Kumar Cherukuri
-
Raghavendra Kumar Chunduri, et. al.Raghavendra Kumar Chunduri ... Aswani Kumar Cherukuri
07 Jul 2021
07 Jul 2021

Design and implementation of reconfigurable acceleration for in-memory distributed big data computing
Junjie Hou ... Shijin Song
Future Generation Computer Systems | VOL. 92
Junjie Hou, et. al.Junjie Hou ... Shijin Song
01 Oct 2018
Future Generation Computer Systems | VOL. 92

A Case Study of Accelerating Apache Spark with FPGA
Junjie Hou ... Shijin Song
-
Junjie Hou, et. al.Junjie Hou ... Shijin Song
01 Aug 2018
01 Aug 2018

A Survey of Scheduling Tasks in Big Data: Apache Spark
Balqees Talal Hasan ... Dhuha Basheer Abdullah
-
Balqees Talal Hasan, et. al.Balqees Talal Hasan ... Dhuha Basheer Abdullah
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving spark application throughput via memory aware task co-location

Abstract

Talk to us

Similar Papers