Speedup your analytics

Jiaheng Lu,Yuxing Chen,Shivnath Babu,Herodotos Herodotou

doi:10.14778/3352063.3352112

Abstract

Database and big data analytics systems such as Hadoop and Spark have a large number of configuration parameters that control memory distribution, I/O optimization, parallelism, and compression. Improper parameter settings can cause significant performance degradation and stability issues. However, regular users and even expert administrators struggle to understand and tune them to achieve good performance. In this tutorial, we review existing approaches on automatic parameter tuning for databases, Hadoop, and Spark, which we classify into six categories: rule-based, cost modeling, simulation-based, experiment-driven, machine learning, and adaptive tuning. We describe the foundations of different automatic parameter tuning algorithms and present pros and cons of each approach. We also highlight real-world applications and systems, and identify research challenges for handling cloud services, resource heterogeneity, and real-time analytics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speedup your analytics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Aug 1, 2019
Citations: 47

Similar Papers

A Survey on Automatic Parameter Tuning for Big Data Processing Systems
Herodotos Herodotou ... Jiaheng Lu
ACM Computing Surveys | VOL. 53
Herodotos Herodotou, et. al.Herodotos Herodotou ... Jiaheng Lu
26 Apr 2020
ACM Computing Surveys | VOL. 53

Automatic bi-objective parameter tuning for inverse planning of high-dose-rate prostate brachytherapy
S C Maree ... T Alderliesten
Physics in Medicine & Biology | VOL. 65
S C Maree, et. al.S C Maree ... T Alderliesten
01 Apr 2020
Physics in Medicine & Biology | VOL. 65

Automatic Performance Tuning for Distributed Data Stream Processing Systems
Herodotos Herodotou ... Yuxing Chen
-
Herodotos Herodotou, et. al.Herodotos Herodotou ... Yuxing Chen
01 May 2022
01 May 2022

Beyond Simple Integration of RDBMS and MapReduce -- Paving the Way toward a Unified System for Big Data Analytics: Vision and Progress
Xiongpai Qin ... Hong Chen
-
Xiongpai Qin, et. al.Xiongpai Qin ... Hong Chen
01 Nov 2012
01 Nov 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speedup your analytics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment