A Control Approach for Performance of Big Data Systems

M Berekmeri,D Serrano,S Bouchenak,N Marchand,B Robu

doi:10.3182/20140824-6-za-1003.01319

Abstract

We are at the dawn of a huge data explosion therefore companies have fast growing amounts of data to process. For this purpose Google developed MapReduce, a parallel programming paradigm which is slowly becoming the de facto tool for Big Data analytics. Although to some extent its use is already wide-spread in the industry, ensuring performance constraints for such a complex system poses great challenges and its management requires a high level of expertise. This paper answers these challenges by providing the first autonomous controller that ensures service time constraints of a concurrent MapReduce workload. We develop the first dynamic model of a MapReduce cluster. Furthermore, PI feedback control is developed and implemented to ensure service time constraints. A feedforward controller is added to improve control response in the presence of disturbances, namely changes in the number of clients. The approach is validated online on a real 40 node MapReduce cluster, running a data intensive Business Intelligence workload. Our experiments demonstrate that the designed control is successful in assuring service time constraints.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Control Approach for Performance of Big Data Systems

Abstract

Talk to us

Similar Papers

More From: IFAC Proceedings Volumes

Lead the way for us

Journal: IFAC Proceedings Volumes	Publication Date: Jan 1, 2014
Citations: 28

Similar Papers

Big Data Platforms and Tools for Data Analytics in the Data Science Engineering Curriculum
Yuri Demchenko
-
Yuri DemchenkoYuri Demchenko
28 Aug 2019
28 Aug 2019

New authentication concept using certificates for big data analytic tools
Paul J E Velthuis ... Martin Steinebach
-
Paul J E Velthuis, et. al.Paul J E Velthuis ... Martin Steinebach
27 Aug 2018
27 Aug 2018

Data Science Model Curriculum Implementation for Various Types of Big Data Infrastructure Courses
Tomasz Wiktorski ... Yuri Demchenko
-
Tomasz Wiktorski, et. al.Tomasz Wiktorski ... Yuri Demchenko
01 Sep 2019
01 Sep 2019

New product success through big data analytics: an empirical evidence from Iran
Farid Shirazi ... Nick Hajli
Information Technology & People | VOL. 35
Farid Shirazi, et. al.Farid Shirazi ... Nick Hajli
19 Aug 2021
Information Technology & People | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Control Approach for Performance of Big Data Systems

Abstract

Talk to us

Similar Papers

More From: IFAC Proceedings Volumes